Trishit Ghosh
Trishit Ghosh

Reputation: 255

Using compressed files with Datafusion

Is there a way to use compressed files with Cloud data fusion. I have used Google Storage as a source and placed a gzip file in the preferred location.

In the wrangler transform, I don't see a preview. When I try to select the file using select Data the zipped file is not highlighted. The steps work fine when I work with an uncompressed file.

Should I be using some transform before I wrangle? Is there a way where I can read a compressed file directly and preview the data. In data prep, the transform identifies the files based on the extension, however, in data fusion, there seems to be no such option.

I was using a basic version of the data fusion environment, would enterprise edition help?

Upvotes: 3

Views: 1072

Answers (2)

Trishit Ghosh
Trishit Ghosh

Reputation: 255

Although the wrangler doesn't let us select the compressed file and perform transforms on the file interactively, we can however enter the wrangler directives manually. The pipeline would work as expected when we supply a compressed file to the source at run time.

Upvotes: 0

Sree
Sree

Reputation: 714

Wrangler expects the files to be uncompressed and does not yet support reading compressed files. I have opened an enhancement request for the same https://issues.cask.co/browse/CDAP-16140

Thanks, Sree

Upvotes: 1

Related Questions