Application logging in Executor/Worker using Azure Databricks python notebooks

Question

I am using Azure Databricks for building and running ETL pipelines. For development, using Databricks notebooks (Python). My goal is to view the application logs via Spark UI for both codes running on driver and executors.

Initially, I was facing issue to view executor logs but as described here https://kb.databricks.com/clusters/set-executor-log-level.html, I am able to view the application logs put inside the code running on worker nodes(executors) like forEach/forEachPartitions.

As written in the above link that we need to set the log level on all executors. Does that mean we need to set the log level inside each code/method meant to be run on worker nodes like below. So will I have to set the logging level in each method, which I think is redundant and should be avoided.

 def doSomething():
   logging.basicConfig(level=logging.INFO, format='%(asctime)s - %(name)s - %(levelname)s - %(message)s')
   ## some operation


 df.forEach(lambda x: doSomething())

To set the log level on all executors, you must set it inside the JVM on each worker.

Is there any better way to do which avoids setting up the log level everytime?

Application logging in Executor/Worker using Azure Databricks python notebooks

Answers (1)

Related Questions