Kehan Chen
Kehan Chen

Reputation: 11

How to read a 30G parquet file by python

I am trying to read data from a large parquet file of 30G. My memory do not support default reading with fastparquet in python, so I do not know what I should do to lower the memory usage of the reading process.

Upvotes: 1

Views: 1093

Answers (1)

Micah Kornfield
Micah Kornfield

Reputation: 1698

You can use pyarrow's iter_batches to read back chunks of rows incrementally.

Upvotes: 1

Related Questions