Reputation: 1725
Would you have any hints on what would be the best way to deal with files containing JSON entries and Hadoop?
Upvotes: 5
Views: 22175
Reputation: 1232
Twitter's elephant-bird library has a JsonStringToMap class which you can use with Pig.
Upvotes: 3
Reputation: 318
MongoDB is a good option when you are dealing with JSON. MongoDB and Hadoop are a powerful combination and can be used together to deliver complex analytics and data processing for data stored in MongoDB. http://www.mongodb.org/
Upvotes: 0
Reputation: 6169
Try this
You can also use JAQL. Its the easiest way to deal with JSON in Map Reduce. Bad thing is that you will have to learn JAQL (unless you know it already) !!
Upvotes: 1
Reputation: 30089
There's a nice article on this from the Hadoop in Practice book:
Upvotes: 4