Reputation: 23
so I'm new in this Hadoop world and I'm trying to understand how mappers and reducers work, my problem (and question) is:
I have a long mapper that before generating [key,value] pairs needs to generate some intermediate files. For example, in a whole mapper: files A and B as inputs, I need an output file E with some intermediate files that can't be reduced.
file A -> file C
file B -> file D
file C + file D -> file E
Is it possible to archieve this? Do intermediate files stay in the nodes?
Upvotes: 0
Views: 1082
Reputation: 1810
This cannot be achieved conventionally. But 2 approaches are possible:
Upvotes: 1