Reputation:
I am trying to install HBase v0.98.6 on ubuntu. I already have Hadoop YARN running on the OS. Should I stop the existing Hadoop processes and rely solely on HBase or configure HBase with existing YARN setup? I can share more information if needed. I am trying to run HBase first on pseudo-distributed and then in distributed mode.
To be clear - I am asking whether I need Hadoop YARN running before I install HBase (in a distributed manner - not on a single computer). If not, and I still have Hadoop YARN on those computers, will it cause any issues for HBase to run on those servers.
Upvotes: 1
Views: 1933
Reputation: 1810
This question is a bit confusing. But the point is HBase & YARN are not dependent on each other.
You can very well stop the YARN service and use HBase. The only services HBase will use from your existing cluster are HDFS & Zookeeper.
Some places people use MapReduce to aggregate data for HBase tables. And in that case both can coexist.
If your use case is real time data access/update with high throughput then i would recommend not running YARN along with HBase.
Hope this was what you were looking for :)
Upvotes: 1