spark: case sensitive partitionBy column

Question

I am trying to write out a dataframe in hiveContext(for orc format) with a partition key:

df.write().partitionBy("event_type").mode(SaveMode.Overwrite).orc("/path");

However the column on which I am trying to partition has case sensitive values and this is throwing an error while writing:

Caused by: java.io.IOException: File already exists: file:/path/_temporary/0/_temporary/attempt_201607262359_0001_m_000000_0/event_type=searchFired/part-r-00000-57167cfc-a9db-41c6-91d8-708c4f7c572c.orc

event_type column has both searchFired and SearchFired as values. However if I remove one of them from the dataframe then I am able to write successfully. How do I solve this?

spark: case sensitive partitionBy column

Answers (1)

Related Questions