Reputation: 251
Can any one help me in configuring a work flow with Map-Reduce action, that takes multiple input paths each input path is associated to one Mapper as like MultipleInputs.addInputPath api takes input path and a mapper. The out put of these mappers will be given to reducer.
I tried this with java action, but it will execute only one map task. But here input path contains huge data, so java action will not us in this case.
Is there any way in handling this case?
Regards, Krish.
Upvotes: 1
Views: 1785
Reputation: 443
In the workflow you can give a comma separated list of input directories in mapred.input.dir
. This will make the files in those directories to run on different mappers.
Upvotes: 1