Difference between File write using spark and scala and the advantages?

Question

DF().write
  .format("com.databricks.spark.csv")
  .save("filepath/selectedDataset.csv")

vs

scala.tools.nsc.io.File("/Users/saravana-6868/Desktop/hello.txt").writeAll("String"))

In the above code, I used to write a file using both dataframes and scala. What is the difference in the above code?

Chandan Ray · Accepted Answer

The first piece of code is specific to SPARK API of writing the dataframe to a file in csv format. You can write to hdfs or local file system using this. even you can repartition and parallellize your write. The second piece of code is SCALA API which can only write in the local file system. You cannot parallelize it. The first code levearage the whole cluster to do its work but not the second piece of code.

Difference between File write using spark and scala and the advantages?

Answers (1)

Related Questions