Reputation: 127
I am trying to read and excel file in Azure Blob Storage with .xlsx extension in my azure data factory dataset. it throws following error
Error found when processing 'Csv/Tsv Format Text' source 'Filename.xlsx' with row number 3: found more columns than expected column count: 1.
What are the right Column and row delimiters for excel files to be read in azure Data factory
Upvotes: 7
Views: 19670
Reputation: 14389
Update March 2022: ADF now has better support for Excel via Mapping Data Flows:
https://learn.microsoft.com/en-us/azure/data-factory/format-excel
Excel files have a proprietary format and are not simple delimited files. As indicated here, Azure Data Factory does not have a direct option to import Excel files, eg you cannot create a Linked Service to an Excel file and read it easily. Your options are:
Let us know how you get on.
Upvotes: 8
Reputation: 11361
Azure Data Factory V2 has recently released an update to support parsing Excel(.xls) files on existing connectors.
Currently, the connections supporting excel files are:
More details can be found here: https://learn.microsoft.com/en-us/azure/data-factory/format-excel
Upvotes: 1
Reputation: 1
I would like to elaborate a bit more on option 3. SSIS packages. I tested the solution and it sorta worked. First of all - SSIS IR is super expensive and You would like to use http post's to start-stop it while it is not needed.
Secondly, SSIS does not support processing blob files out of the box. There are some ways around it. You can use some third-party soft (didn't try that), or explore options
In my case I used trick to deploy package that download file locally, process and then delete it. Since SSIS IR is a VM, it basically did the same on IR after deployment. It was a trick rather than solution and it was not 100% stable. Another issue - numeric fields got some insignificant error during processing - eg. uploading 25 resulted in 25.0000000001 at db site. No idea why. I did not have enough time to investigate issues deeper.
Upvotes: 0
Reputation: 11
Azure Data Factory does not have a direct option to upload the excel, however, it can be uploaded through linked service via some configurations. Follow below steps to do so:
Upvotes: 0