How to trigger catalyst optimizer before spark-submit to reduce execution time?

Question

The organisation I am working on is moving to public cloud from their old traditional way of execution. we have to pay for all executions that takes place over the cloud. for reducing this execution cost, we are doing two things:

we are trying to avoid all bad execution.
we are trying to reduce the execution time further.

As a big data engineer, my work mostly depends on SparkSQL and I am trying to reduce the SQL query execution time. what catalyst do at execution time, I want to do that before the execution. for ex- reading the logical plan, optimizing the logical plan and generating the physical plan etc. I also want to add my custom optimization plans in catalyst which will also be triggered at build time.

Is there any way to do all this before execution?

How to trigger catalyst optimizer before spark-submit to reduce execution time?

Answers (1)

Related Questions