Reputation: 141
Hive is always coupled with HDFS. My question is, can Hive run on any other location apart from HDFS? Just like spark, does it support any data storage?
Upvotes: 0
Views: 322
Reputation: 10406
Hive was built for hadoop, and therefore for HDFS. Here is what the official documentation says about it
What Is Hive
Hive is a data warehousing infrastructure based on Apache Hadoop. Hadoop provides massive scale out and fault tolerance capabilities for data storage and processing on commodity hardware.
Yet, according to wikipedia, Hive can work on "compatible" file systems:
Apache Hive supports analysis of large datasets stored in Hadoop's HDFS and compatible file systems such as Amazon S3 filesystem and Alluxio.
AFAIK, there is no documentation about this in Hive's official documentation but you can find plenty on AWS and Alluxio's websites.
Upvotes: 1