Naresh
Naresh

Reputation: 5397

How to install cloudera impala on EMR?

Is there anyway i can install the only impala without cloudera manager and without cdh. I will be using the apache version of hadoop?

Upvotes: 1

Views: 2981

Answers (3)

user3098246
user3098246

Reputation: 11

You can view detailed instructions on how to install and use Impala with Amazon EMR here: http://docs.aws.amazon.com/ElasticMapReduce/latest/DeveloperGuide/emr-impala.html

Upvotes: 1

rudygodoy
rudygodoy

Reputation: 478

EMR is based on a Amazon Hadoop distribution that runs on top of Debian squeeze. So, yes it's possible using Cloudera's DEB repo.

You will need to SSH to your EMR master node, find the address on EMR console.

You will also need to enable security rules on the security group you have assigned to your EMR cluster, if you intend to connect to Impala using a JDBC/ODBC client form the outside world.

Upvotes: -1

Tariq
Tariq

Reputation: 34184

Yes, it is absolutely possible. Add the repository into your sources.list file and update the repository after that.

deb [arch=amd64] http://archive.cloudera.com/impala/ubuntu/precise/amd64/impala precise-impala1 contrib deb-src http://archive.cloudera.com/impala/ubuntu/precise/amd64/impala precise-impala1 contrib

After that, it's merely :

sudo apt-get install impala                (Binaries for daemons)
sudo apt-get install impala-server         (Service start/stop script)
sudo apt-get install impala-state-store    (Service start/stop script)

But do not forget to meet all the prerequisites. For a detailed info you can go here

Upvotes: 1

Related Questions