How do I load large number of small CSV files from s3 to redshift?

Question

I have large number of CSV files (~12k) which are small (~250 records each). I want to load them to redshift cluster of size 3 in the same region, but it's taking a really long time.

The query I used in SQL Workbench/J is :

copy gsod from 's3://[path to folder]' access_key_id '******' secret_access_key '******' delimiter ',' BLANKSASNULL emptyasnull IGNOREHEADER 1 maxerror as 100000;

The query works in seconds if I use single file. But What's the best way to load all of them as quickly as possible ?

I have tried loading the files from s3 from the same region of the cluster.

copy gsod from 's3://[path to folder]' access_key_id '******' secret_access_key '******' delimiter ',' BLANKSASNULL emptyasnull IGNOREHEADER 1 maxerror as 100000;

How do I load large number of small CSV files from s3 to redshift?

Answers (1)

Related Questions