kannadhasan
kannadhasan

Reputation: 339

How to load data from Cassandra to HDFS?

I have a data which is resides in Apache Cassandra,I want to perform map/reduce jobs using hadoop Ecosystem tools.

How to load data from Cassandra to HDFS?

Is there any storage handler other than Cassandra storage handler/brisk ?

Upvotes: 3

Views: 6240

Answers (3)

oneself
oneself

Reputation: 40231

There's a new tool from Netflix called Aegisthus which tries to address this.

A Bulk Data Pipeline out of Cassandra. Aegisthus implements a reader for the SSTable format and provides a map/reduce program to create a compacted snapshot of the data contained in a column family.

Upvotes: 2

Hussain Pirosha
Hussain Pirosha

Reputation: 1376

Consider looking at this project pygmalion. This shall allow you to export data from Cassandra into HDFS and then use Pig latin for querying.

Upvotes: 1

Tariq
Tariq

Reputation: 34184

You could do that in multiple ways. Write a MapReduce job to read data from Cassandra and write it to HDFS or write a PigLatin script to do that. See this for more info.

Upvotes: 0

Related Questions