Reputation: 760
This may be very basic question but I would like to know that for using queries in Hive and HBase, do we need a running Hadoop cluster?
I would like to use Hive or HBase with AWS S3 storage to store my data but wondering if I would need my cluster running all the time to be able to run queries as this would incur me cost?
Upvotes: 0
Views: 97
Reputation: 13430
Hive no, HBase yes, because it needs "a real filesystem". AWS EMR have done "things" to their version of HBase to work around this, but they've not shared that work with the open source code, so you only get it with EMR
Upvotes: 3