Reputation: 123
Just trying to wrap my head around this as I was doing a lot of reading and there's a lot of confusing marketing information out there.
So my company lives in a world of relational databases including Oracle, MS Access and MS SQL Server.
We were wanting to consolidate data sources into a data lake, in particular Azure Data Lake which is marketed as being able to store any sort of data, but upon further reading it seems that it can only store data in a few file formats including parquet, CSV etc.
So my question is - If we have a bunch of relational databases, do we need to
Did I get this right? For some reason I naively thought you could just grab the data files and drop it into the data lake. I just wanted to make sure that there is a step needed to "convert" the source table(s) and output it into a more common file format (csv, parquet etc).
Upvotes: 1
Views: 413
Reputation: 3230
I just wanted to make sure that there is a step needed to "convert" the source table(s) and output it into a more common file format (csv, parquet etc).
Azure Data Factory supports only below formats as Source.
You can connect select tables from Oracle, MS Access and SQL Server as Source. But, you can not select .dbf, .accdb and .mdf file formats as source in ADF.
ADLS supported file formats are given in this official documentation
Refer - https://learn.microsoft.com/en-us/azure/data-factory/supported-file-formats-and-compression-codecs
Upvotes: 1