Reputation: 2415
I'm looking for a parser for the wiki markup language used by wikipedia which can convert the input wiki markup text into a parse tree of java objects. I've come across a few parsers but they parse the markup text into HTML like:
java-wikipedia-parser
Mylyn WikiText
Upvotes: 1
Views: 316
Reputation: 101
WikiText isn't really set up to be parsed in this way.
What you might consider doing is looking at Parsoid – it generates HTML with sufficient annotations that you could convert it into a parse tree.
Otherwise, MediaWiki.org has a page about alternative parsers. It's probably hopelessly out of date, though.
Upvotes: 2