Reputation: 63
Is it possible to configure hadoop streaming to read two or more input arguments at runtime for a job?
For example, let's say I have a script which is executed as: my_script file1 file2
How can I specify this in hadoop streaming?
As far as I know, I can only specify jobs which have the following execution syntax:
my_script "fixed_params" "input"
.
Upvotes: 2
Views: 1026
Reputation: 841
Haven't worked in streaming much, but I'm pretty sure you can just add another -input
argument.
Also see: Using multiple mapper inputs in one streaming job on hadoop?
Upvotes: 1