Dataframe will directly connect to RDBMS from Executor or it will go through Driver?

Question

In Spark Dataframe im looking for following under the hood explanation for optimization.

Data Frames are special type of RDD, it internally contains Row RDDs. These RowRDDs are spread across the executors.
When we write these RowRDDs from executors ( especially while running in YARN-CLIENT mode ) the Row RDDs will be transferred from EXECUTOR to DRIVER and DRIVER writes into Oracle using the JDBC connection.(Is this True?)
When run the same code in YARN-CLUSTER mode, the Row RDDs are written to Oracle directly from Executor. This could be a faster approach, but available JDBC connection could limit /slow down the process.

I'm not sure this is what happens under the hood kindly validate this and correct me if im wrong. This will impact a big performance factor.

Thanks in advance.

Answers (1)