Reputation: 1
Azure synapse dedicated SQL pools exports to ADLS storage account via polybase have .parq extensions instead of .parquet extensions.
CREATE EXTERNAL DATA SOURCE [SomeExternalDataSourcename] WITH(TYPE=HADOOP, LOCATION=N'abfss://[email protected]/foldername') ;
CREATE EXTERNAL FILE FORMAT [ff_Parquet] WITH (FORMAT_TYPE = PARQUET)
CREATE EXTERNAL TABLE [staging_schema].[table] WITH(LOCATION='folder/schema_table', DATA_SOURCE=[SomeExternalDataSourcename], FILE_FORMAT=[ff_Parquet]) AS SELECT * FROM [schema].[table];
The result is below
we were expecting .parquet extension. Is there anyway to generate exports directly with .parquet extensions ?
Upvotes: 0
Views: 224
Reputation: 7156
Dedicated SQL pool creates .parq
file only when using externat tables to create a file. I tried and got the .parq
extension file only. When searched for similar issue, got this in Microsoft QnA platform . Unfortunately, the file extension cannot be changed to the standard ".parquet" extension in the ADLS gen2 using dedicated SQL pool. Workaround is to rename the files only. You can use ADF pipeline to rename the file.
get metadata activity
..parq
as extension in the file name from the get metadata activity output. In the sink dataset , split the source data filename with the dot(.)
symbol and give the <filename>.parquet
as the filename.You can log your feedback about this in Microsoft azure feedback platform.
Upvotes: 0