Vitali Dedkov
Vitali Dedkov

Reputation: 318

Unzip gzip files in Azure Data factory

I am wondering if it is possible to set up a source and sick in ADF that will unzip a gzip file and shows the extracted txt file. What happened is that the sink was incorrectly defined where both the source/sink had gzip compression.

So what ended up is that "fil1.gz" is now "file1.gz.gz".

This is how the file looks in Azure blob: enter image description here

This is how the file looks like in an S3 bucket (the end is cut off, but the end is "txt.gz"): enter image description here

I saw that in COPY there is Zipdeflate and deflate compression, but I get an error that it does not support this type of activity.

I created a sink in an ADF pipeline where I am trying to unzip it. In the datasource screen I used Zipdeflate, but it puts the file name with "deflate" extention, and not with the 'txt'.

enter image description here

Thank you

Upvotes: 1

Views: 6759

Answers (1)

ArunasB
ArunasB

Reputation: 124

create a "copy data" object Source: as your extenstion is gz, you should choose GZip as compresion type, tick binary copy Target: Blob Storage Binary compresion- none

Such copy pipeline will unzip your text file(s)

Upvotes: 2

Related Questions