Running Hive Queries in Python?

Question

I've been stuck on this for a few hours and am having trouble making any progress. I have a remote Hadoop instance with Hue server I've been running Hive queries against. These work fine. I've been hoping to run the queries against hive directly through Python now, but this is where my problems arise. I've tried running things through both Python Hive Utils and pyhs2. The former gives me:

thrift.Thrift.TApplicationException: Invalid method name: 'get_database'

The latter just times out.

I know the server is using 0.10.0-cdh4.3.0, but I don't know how to tell if it's using HiveServer or HiveServer2.

So, my question is three-fold:

How do I know which version of HiveServer is being used?
Does that matter in terms of connecting through Python?
Why can't I connect?

Running Hive Queries in Python?

Answers (1)

Related Questions