Write DataFrame from Databricks to Data Lake

Question

It happens that I am manipulating some data using Azure Databricks. Such data is in an Azure Data Lake Storage Gen1. I mounted the data into DBFS, but now, after transforming the data I would like to write it back into my data lake.

To mount the data I used the following:

configs = {"dfs.adls.oauth2.access.token.provider.type": "ClientCredential",
       "dfs.adls.oauth2.client.id": "",
       "dfs.adls.oauth2.credential": "",
       "dfs.adls.oauth2.refresh.url": "https://login.microsoftonline.com//oauth2/token"}

dbutils.fs.mount(source = "adl://.azuredatalakestore.net/", mount_point = "/mnt/",extra_configs = configs)

I want to write back a .csv file. For this task I am using the following line

dfGPS.write.mode("overwrite").format("com.databricks.spark.csv").option("header", "true").csv("adl://.azuredatalakestore.net/")

However, I get the following error:

IllegalArgumentException: u'No value for dfs.adls.oauth2.access.token.provider found in conf file.'

Any piece of code that can help me? Or link that walks me through.

Thanks.

Write DataFrame from Databricks to Data Lake

Answers (1)

Related Questions