user2732748
user2732748

Reputation: 97

Cassandra integration with Hadoop

I am newbie to Cassandra. I am posting this question as different documentations were providing different details with respect to integeting Hive with Cassandra and I was not able to find the github page.

I have installed a single node Cassandra 2.0.2 (Datastax Community Edition) in one of the data nodes of my 3 node HDP 2.0 cluster.

I am unable to use hive to access Cassandra using 'org.apache.hadoop.hive.cassandra.cql3.CqlStorageHandler'. I am getting the error ' return code 1 from org.apache.hadoop.hive.ql.exec.DDLTask. org.apache.hadoop.hive.ql.metadata.HiveException: Error in loading storage handler.org.apache.hadoop.hive.cassandra.cql3.CqlStorageHandler'

I have copied all the jars in /$cassandra_home/lib/* to /$hive-home/lib and also included the /cassandra_home/lib/* in the $HADOOP_CLASSPATH.

Is there any other configuration changes that I have to make to integrate Cassandra with Hadoop/Hive?

Please let me know. Thanks for the help!

Thanks, Arun

Upvotes: 1

Views: 3355

Answers (1)

Roman Nikitchenko
Roman Nikitchenko

Reputation: 13036

Probably these are starting points for you:

Actually your question is not so narrow, there could be lot of reasons for this. But what you should remember Hive is based on MapReduce engine.

Hope this helps.

Upvotes: 1

Related Questions