Igor Semenko
Igor Semenko

Reputation: 459

Does Presto support Parquet format?

Running CDH4 cluster with Impala, I created parquet table and after adding parquet jar files to hive, I can query the table using hive.

Added same set of jars to /opt/presto/lib and restarted coordinator and workers.

parquet-avro-1.2.4.jar
parquet-cascading-1.2.4.jar
parquet-column-1.2.4.jar
parquet-common-1.2.4.jar
parquet-encoding-1.2.4.jar
parquet-format-1.0.0.jar
parquet-generator-1.2.4.jar
parquet-hadoop-1.2.4.jar
parquet-hive-1.2.4.jar
parquet-pig-1.2.4.jar
parquet-scrooge-1.2.4.jar
parquet-test-hadoop2-1.2.4.jar
parquet-thrift-1.2.4.jar

Still getting this error when running parquet select query from Presto:

> select * from test_pq limit 2;
Query 20131116_144258_00002_d3sbt failed : org/apache/hadoop/hive/serde2/SerDe 

Upvotes: 3

Views: 5279

Answers (2)

David Phillips
David Phillips

Reputation: 10218

Presto now supports Parquet automatically.

Upvotes: 2

Damien Carol
Damien Carol

Reputation: 1

Try to add the jars in presto plugin dir instead of presto lib dir.

Presto auto loads jars from plugins dirs.

Upvotes: 0

Related Questions