Daniel Haviv
Daniel Haviv

Reputation: 1036

stream reading with pyarrow results in module 'pyarrow' has no attribute 'StreamReader'

I'm trying to read a large parquet file using the pyarrow's stream reader as described here.
But I get the following error:

module 'pyarrow' has no attribute 'StreamReader'

I've tried switching to RecordBatchStreamReader but it seems its not made for reading Parquet files directly.

I'm using pyarrow 0.14.

Upvotes: 1

Views: 825

Answers (1)

Wes McKinney
Wes McKinney

Reputation: 105521

RecordBatchStreamReader is for reading the Arrow binary protocol, not for Parquet files. Try using pyarrow.parquet.read_table

http://arrow.apache.org/docs/python/parquet.html

Upvotes: 1

Related Questions