Praveen Kumar B N
Praveen Kumar B N

Reputation: 141

Can Hive Work on any storage system or does it need HDFS itself?

Hive is always coupled with HDFS. My question is, can Hive run on any other location apart from HDFS? Just like spark, does it support any data storage?

Upvotes: 0

Views: 322

Answers (1)

Oli
Oli

Reputation: 10406

Hive was built for hadoop, and therefore for HDFS. Here is what the official documentation says about it

What Is Hive

Hive is a data warehousing infrastructure based on Apache Hadoop. Hadoop provides massive scale out and fault tolerance capabilities for data storage and processing on commodity hardware.

Yet, according to wikipedia, Hive can work on "compatible" file systems:

Apache Hive supports analysis of large datasets stored in Hadoop's HDFS and compatible file systems such as Amazon S3 filesystem and Alluxio.

AFAIK, there is no documentation about this in Hive's official documentation but you can find plenty on AWS and Alluxio's websites.

Upvotes: 1

Related Questions