Job failed using spark-submit with parameters when job runs in Azure databricks.. INIT_SCRIPT_FAILURE (CLIENT_ERROR)

Question

I created a simple netcore3.1 project and ran locally successfully. I wanted to test the Microsoft netcore3.1 application into azure databricks cluster to run job. I followed the instruction as per Microsoft documentation, but job failed.

https://learn.microsoft.com/en-us/previous-versions/dotnet/spark/tutorials/databricks-deployment

SparkSession spark = SparkSession
.Builder()
.AppName("HelloWorldSparkApp")
.GetOrCreate();

 // Create a DataFrame with a single row and single column  
 DataFrame df = spark.Sql("SELECT 'Hello, World!' AS message");

 // Show the DataFrame  
 df.Show();

 // Stop the Spark session  
 spark.Stop();

Cluster configuration

single node
Databricks runtime version : runtime 10.4(Scala 2.12,spark 3.2.1)
Node type: standard_d4ds_v5
Advanced Options: Init Script (WorkSpace and ABFSS options. No DBFS option and also shows warning message)

copied db-init.sh to workspace->shared folder because there is no DBFS option.

Job configuration

Task name: testjob
Type : Spark-Submit
Cluster : created earlier
parameters : ["--class","org.apache.spark.deploy.dotnet.DotnetRunner", "/dbfs/spark-dotnet/microsoft-spark-3-2_2.12-2.1.1.jar", "/dbfs/spark-dotnet/HelloSparkCore31.zip","HelloSparkCore31"]

Publish application:

Using VS2019 publish, self -contained option and runtime win-x64
create a zip file

Databricks dbfs/spark-dotnet folder content as per ms documentation -db-init.sh -install-worker.sh -microsoft-spark-3-2_2.12-2.1.1.jar -Microsoft.Spark.Worker.netcoreapp3.1.linux-x64-2.1.1.tar.gz -HelloSparkCore31.zip

Job creation is okey but got exception when job starts.

Exception as per job output

Cluster '0901-204609-m9yiukve' was terminated. Reason: INIT_SCRIPT_FAILURE (CLIENT_ERROR). Parameters: instance_id:93d671e9f0884221b689a09b125d2655, databricks_error_message:Cluster scoped init script /Shared/db-init.sh failed: Script exit status is non-zero.

I am in learning stage about databricks. I searched google a lot but could not resolve.

Any kind of help or hints would be greatly appreciated.

Job failed using spark-submit with parameters when job runs in Azure databricks.. INIT_SCRIPT_FAILURE (CLIENT_ERROR)

Exception as per job output

Answers (1)

Related Questions