Reputation: 1036
I'm trying to read a large parquet file using the pyarrow's stream reader as described here.
But I get the following error:
module 'pyarrow' has no attribute 'StreamReader'
I've tried switching to RecordBatchStreamReader but it seems its not made for reading Parquet files directly.
I'm using pyarrow 0.14.
Upvotes: 1
Views: 825
Reputation: 105521
RecordBatchStreamReader
is for reading the Arrow binary protocol, not for Parquet files. Try using pyarrow.parquet.read_table
http://arrow.apache.org/docs/python/parquet.html
Upvotes: 1