Reputation: 497
Given a parquet file how can I create the table associated with it into my redshift database? Oh the format of the parquet file is snappy.
Upvotes: 0
Views: 1943
Reputation: 4486
If you're dealing with multiple files, especially over a long term, then I think the best solution is to upload them to an S3 bucket and run a Glue crawler.
In addition to populating the Glue data catalog, you can also use this information to configure external tables for Redshift Spectrum, and create your on-cluster tables using create table as select
.
If this is just a one-off task, then I've used parquet-tools
in the past. The version that I've used is a Java library, but I see that there's also a version on PyPi.
Upvotes: 3