How can I tell how many mappers and reducers are running?

Question

I have a task that is designed to run dozens of map/reduce jobs. Some of them are IO intensive, some are mapper intensive, some are reducer intensive. I would like to be able to monitor the number of mappers and reducers currently in use so that, when a set of mappers is freed up, I can push another mapper intensive job to the cluster. I don't want to just stack them up on the queue because they might clog up the mapper and not let the reducer-intensive ones run.

Is there a command line interface I can call to get this information from (for instance) a Python script?

Robert Rapplean · Accepted Answer

I discovered that

mapred job -list

will list all of the jobs currently running, and

mapred job -status

will provide the number of mappers and reducers for each job.

How can I tell how many mappers and reducers are running?

Answers (2)

Related Questions