Naveen Srinivas
Naveen Srinivas

Reputation: 71

Map method skipped when processing Gzip files

I am working with Mapreduce to process Xml files. I have written Custom Input Format by following the code in http://thinkbigdataanalytics.com/xmlinputformat-hadoop/. When I run the job, it succeeds for XML Files. But when I run the same code for gzip files containing Xmls, the map method is not even invoked.

It says,

Map-Reduce Framework

Map input records=0 

I have a few Loggers inside map() and they are not showing up too. Please suggest a solution!

Upvotes: 0

Views: 44

Answers (1)

Naveen Srinivas
Naveen Srinivas

Reputation: 71

The Solution was to get the file codec and use the appropriate compression codec using the CompressionCodecFactory Class. Please Refer http://www.javased.com/index.php?source_dir=Cloud9/src/dist/edu/umd/cloud9/collection/XMLInputFormat.java

Upvotes: 0

Related Questions