fastparquet : how to disable rle encoding

Question

I am using fastparquet to convert pandas dataframes to parquet files. It is much faster than my previous approach which was using pyspark.

I want to read these parquet files using spark i.e.

 sqlCtx.read.parquet('/tmp/parquet/test.parquet')

I had a few issues which I managed to resolve. The issue I have now is with RLE encoding. I am getting the following java exception when I try to read the parquet file with pyspark:

Unsupported encoding: RLE

Is there a way to disable RLE when using the fastparquet write method?

fastparquet : how to disable rle encoding

Answers (1)

Related Questions