Can I make my Hadoop reducer quicker?

Question

I'm newbie to Hadoop, and just trying the wordcount example. I just build a single node referring to http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/

I upload a very simple text with few words to HDFS, and run the wordcount.jar.

Somehow it takes very very long time for reducer to process. I know it is the I/O bottleneck, but are there any ways I can set some parameters and make it faster? (lol, the reduce process is still 0%, almost 20 minutes)

13/06/04 15:53:14 INFO mapred.JobClient:  map 100% reduce 0%

Kun Ling · Accepted Answer

It seems your Hadoop have some issues, and the MR could not run correctly.

Please check:

Whether your Hadoop work correctly by access http://localhost:50030, which is the JobTracker WebUI of your hadoop
Look into the log files on your $HADOOP_HOME/logs/, especially the *jobtracker*.log, and *tasktracker*.log.

Usually, if it is your first time testing Hadoop. Please check this link: Hadoop WordCount example stuck at map 100% reduce 0%

Can I make my Hadoop reducer quicker?

Answers (2)

Related Questions