Write a DataFrame to csv file with a custom row/line delimiter/separator

Question

I need to produce a delimited file where each row it separated by a '^' and columns are delimited by '|'.

There don't seem to be options to change the row delimiter for csv output type.

eg:

df.coalesce(1).write\
.format("com.databricks.spark.csv")\
.mode("overwrite")\
.option("header", "true")\
.option("sep","|")\
# no options for setting lineSep to '^' 
.save(destination_path)

MahzadK · Accepted Answer

One solution consists of to convert the DataFrame to rdd :

df.rdd.map(x=>x.mkString("^")).saveAsTextFile("OutCSV")

Write a DataFrame to csv file with a custom row/line delimiter/separator

Answers (2)

Related Questions