HBase with MapReduce option

Question

I am new to Hadoop and I recently installed Hive and HBase.

I created few tables in Hive and the queries are running in MapReduce fashion. Also, when I say 'get' in HBase, it is not running in MapReduce and that is why I believe it is of high latency.

I have few quick doubts here.

If I have an application where I need to process real time streaming data, what shall I use - Hive or HBase?
Can I install HBase with MapReduce option, such that the get command in HBase runs in MapReduce fashion

Thanks in advance

dino.keco · Accepted Answer

HBase is database which and it doesn't have option to run mapreduce for its operations like get, scan, put ...

If you want to process data from HBase in mapreduce style You need to create custom map reduce job, or use some other analytics tool like Hive, Pig, ...

Hive is data warehouse platform built on top of Hadoop mapreduce. It can read data from many different sources like, HDFS file, S3 file, HBase, etc ...

Hope this is useful for You.

HBase with MapReduce option

Answers (2)

Related Questions