Issue with custom container for Delta Table in Azure Synapse

Question

I currently have a pyspark code to copy parquet files in one container to another container and at same time create a delta table in the destination container. Both containers are set up in the same ADLS Gen2 Storage, even though I have set the destination container when i run the notebook Azure Synapse still creates the folder in the default location.

from pyspark.sql import SparkSession

# Initialize Spark session
spark = SparkSession.builder.appName("DeltaTableCreation").config("spark.jars.packages", "io.delta:delta-core_2.12:1.0.0").getOrCreate()

# Specify your Data Lake Storage account and containers
data_lake_account = "youraccount"
source_container = "source-container"
destination_container = "your-custom-destination-container"

# Specify paths for source and destination containers
source_container_path = f"abfss://{source_container}@{data_lake_account}.dfs.core.windows.net/path/to/source"
destination_container_path = f"abfss://{destination_container}@{data_lake_account}.dfs.core.windows.net/path/to/destination"

# Function to recursively discover Parquet files in nested folders
def discover_parquet_files(base_path):
    return spark.read.format("parquet").option("recursiveFileLookup", "true").load(base_path)

# Read data from multiple Parquet files in the source container
delta_table = discover_parquet_files(source_container_path)

# Save the Delta table to the destination container with an explicitly specified path
delta_table.write.format("delta").mode("overwrite").save(destination_container_path)

# Stop the Spark session
spark.stop()

Issue with custom container for Delta Table in Azure Synapse

Answers (1)

Related Questions