Flak
Flak

Reputation: 2670

How to extract and import Wikipedia pages?

I'm building a search engine, and to test it well, it needs more articles. Best source for them is Wikipedia.

I have searched for some dumps, but some are XML (which I am having troubles to import), some are not with content there.

So, how to get a dump, preferably in MySQL form. It has to be a non-English language.

Any idea?

Upvotes: 0

Views: 2687

Answers (1)

bmargulies
bmargulies

Reputation: 100050

Here is a page explaining how to import Wikipedia to Solr.

Here is a step-by-step explanation of loading a Wikipedia dump into Mysql to run a local clone.

Upvotes: 3

Related Questions