avinash
avinash

Reputation: 147

How can I import XML data into Hadoop

Am quite new to Hadoop and I wanted to import the semi-structured data - XML into HDFS. What are the ways to import XML data from a remote location to HDFS and any open source tools used for it? Can Flume import XML data into HDFS? Thanks in advance

Upvotes: 3

Views: 1954

Answers (1)

Jinith
Jinith

Reputation: 438

You could try writing use HDFS Java API to create files in HDFS and write the whole content of the XML in each files.

Yes you could also go with flume, if there would be a large number of XML files getting generated from the source and will want to be sinked into HDFS.

You can have a look at these links : http://www.dummies.com/how-to/content/log-data-with-flume-in-hdfs.html

Upvotes: 1

Related Questions