How to read parquet file from s3 bucket in nifi?

Question

I am trying to read parquet file from s3 bucket in nifi. to read the file I have used processor listS3 and fetchS3Object and then ExtractAttribute processor. till there it looked fine.

the files are in parquet.gz file and by no mean i was able to generate the flowfile from them, My final purpose is to load the file in noSql(SnowFlake).

FetchParquet works with HDFS which we are not used.

My next option is to use executeScript processor (with python) to read these parquet file and save them back to text.

Can somebody please suggest any work around.

How to read parquet file from s3 bucket in nifi?

Answers (1)

Related Questions