Azure Data Factory, cant use Excel Dataset with Wildcard

Question

My Problem: I have a Data Lake Gen 2 Storage Account with an import directory which contains .xlsx files.

Now im trying to create a Dataset pointing to this directory. The Dir will contain multiple .xlsx files and also an archive directory, where processed .xlsx files will be moved to.

I want to point the data set specifically into the import folder and not into the import/archive folder - from what I've read i should use a wildcard like *.xlsx in the import dir.

However, I cannot get the dataset to work with the Wildcard, when I point it directly to the FileName.xlsx file its no problem:

working dataset pointing directly to one file

not working using wildcard

what am I doing wrong?

I tried to write the Sheet name manually and also tried to use the sheet index 0 manually, both give me an error:

ADLS Gen2 operation failed for: Operation returned an invalid status code 'NotFound'. Account: '********'. FileSystem: ''. Path: 'ingress/industrysectors/import/*.xlsx'. ErrorCode: 'PathNotFound'. Message: 'The specified path does not exist.'.

when removing the wildcard '*.xlsx' the file gets found, however that also means that in the future the .xlsx files in the import/archive folder will also be considered in the data set.

Azure Data Factory, cant use Excel Dataset with Wildcard

Answers (1)

Related Questions