Why use Hadoop why we have Spark?

Question

Can anyone please provide some guidance as why should we use Hadoop while now Spark is available? As we all know Spark was created in the first place to solve the limitations of Hadoop?

Thank you.

Ghislain Fourny · Accepted Answer

Hadoop has several components, including a distributed file system, HDFS, a parallel data processing framework, MapReduce, and a wide column store, HBase.

While Spark can be seen as a next-generation version of MapReduce with generalized dataflows (DAGs), Spark does not replace HDFS or HBase. Rather, it can consume data from HDFS and HBase as input, and write data back to them.

I hope this helps!

Why use Hadoop why we have Spark?

Answers (2)

Related Questions