Reputation: 71
I am working with Mapreduce to process Xml files. I have written Custom Input Format by following the code in http://thinkbigdataanalytics.com/xmlinputformat-hadoop/. When I run the job, it succeeds for XML Files. But when I run the same code for gzip files containing Xmls, the map method is not even invoked.
It says,
Map-Reduce Framework
Map input records=0
I have a few Loggers inside map() and they are not showing up too. Please suggest a solution!
Upvotes: 0
Views: 44
Reputation: 71
The Solution was to get the file codec and use the appropriate compression codec using the CompressionCodecFactory
Class. Please Refer http://www.javased.com/index.php?source_dir=Cloud9/src/dist/edu/umd/cloud9/collection/XMLInputFormat.java
Upvotes: 0