Reputation: 2284
I'm wonder upto what level of hadoop programming can be done using PHP or Ruby. I found articles taking about hadoop streaming api which can be hacked from PHP, Ruby.
My questions
Can you write a map-reduce job in PHP, Ruby which can work with other hadoop java map-reduce jobs?
In-terms of API level programming what is missing for non-java languages in hadoop? i.e something that can be done only in java right now vs other languages ?
Thanks
Upvotes: 0
Views: 618
Reputation: 8088
In a nutshell - hadoop has number of other plugins aside of mappers and reducers: combiners, input/output formats comparators. These plagins can be written in java only.
So it means that using hadoop via streaming can suite some simple cases but will seriously reduce your flexibility.
Streaming is also somewhat slower because different mechanisms are used to pass records to the mappers and reducers.
Upvotes: 2
Reputation: 28971
Personally, I also come from a Ruby/Python background, and at first tried using streaming to get things done. Eventually I decided to give the Java API a chance, and it turned out to be not too bad :)
Upvotes: 1