how to configure beam application with spark runner to use S3ACommitter?

Question

I have a beam application and its running with spark runner. It encountered kind of data lost issue as this application save data to a S3 storage. I looked into this page https://hadoop.apache.org/docs/current/hadoop-aws/tools/hadoop-aws/committers.html

it suggested to use S3A Committer for spark jobs. I followed the suggestion to add configuration to this beam job, but i don't know if it does used S3A Committerti save data. so j just wanna ask, what to configure this beam job to use this S3 committer? and how to prove it used S3A committer?

how to configure beam application with spark runner to use S3ACommitter?

Answers (1)

Related Questions