StackUnderflow
StackUnderflow

Reputation: 25368

Column store on top of hadoop?

Is there a column store similar to Vertica that is built on top of Hadoop.. I am not talking about HBase as it is sparse matrix store and can not get the level of compression that can be achieved by something like Vertica?

Thanks

Upvotes: 0

Views: 880

Answers (4)

Antonio
Antonio

Reputation: 774

Look on Hadapt http://hadapt.com/

This is a commercial version of HadoopDB http://db.cs.yale.edu/hadoopdb/hadoopdb.html developed at Yale University. It can work with a column-oriented DB (which is installed on every node of a computer cluster) while leverages Hadoop for fault-tolerant execution.

Upvotes: 0

Sandeep
Sandeep

Reputation: 11

RCFile is a good start. RCFile stores data in a PAX layout -- columnar within blocks that could be as large as HDFS's block size. There is a paper at VLDB 2011 describing another columnar storage format here and a blog post with a short comparison to RCFile here.

Upvotes: 1

geoffrobinson
geoffrobinson

Reputation: 1580

I haven't worked with Hadoop, but I know Vertica has been trying to integrate with Hadoop.

http://www.vertica.com/the-analytics-platform/native-bi-etl-and-hadoop-mapreduce-integration/

Upvotes: 0

Steve Severance
Steve Severance

Reputation: 6646

Are you looking for something like RCFile? It is a file type that uses a columnar store internally.

Upvotes: 3

Related Questions