Reputation: 339
I have a data which is resides in Apache Cassandra,I want to perform map/reduce jobs using hadoop Ecosystem tools.
How to load data from Cassandra to HDFS?
Is there any storage handler other than Cassandra storage handler/brisk ?
Upvotes: 3
Views: 6240
Reputation: 40231
There's a new tool from Netflix called Aegisthus which tries to address this.
A Bulk Data Pipeline out of Cassandra. Aegisthus implements a reader for the SSTable format and provides a map/reduce program to create a compacted snapshot of the data contained in a column family.
Upvotes: 2
Reputation: 1376
Consider looking at this project pygmalion. This shall allow you to export data from Cassandra into HDFS and then use Pig latin for querying.
Upvotes: 1