Spark partitions taking uneven time to execute with frequent executor lost failure

Question

My spark application processes input CSV file in several stages. At each stage, the RDD's created are large in size(in MB's to GB's). I have created different number of partitions and tried but always the partitions complete the stage unevenly. Some partitions finish a stage very soon but last few partitions always take lot of time and keep throwing heartbeat timeout and executor lost failures and keeps retrying.

I have tried changing the number of partitions to different values but always the last few partitions never complete. I could not fix this issue after trying for long too.

How do I handle this?

Spark partitions taking uneven time to execute with frequent executor lost failure

Answers (1)

Related Questions