Reputation: 41
i thought that the wiki dump XML file TAGS will be like
<page>
<title> </title>
<content> </content>
</page>
<page>
<title> </title>
<content> </content>
</page>
in addition to other tags. i managed to find the page and title tags, but i still can not find where the main article is, in body tag, or content tag, or article tag, any help
Upvotes: 2
Views: 417
Reputation: 417
The main article will be inside tag <page>
,then<revision>
and inside them search for <text>
Upvotes: 4