value option is not a member of org.apache.spark.sql.DataFrame

Question

I'm trying to create a data frame in scala as below:

var olympics =spark.read.csv("/FileStore/tables/Soccer_Data_Set_c46d1.txt").option("inferSchema","true").option("header","true").option("delimiter",",")

When I submit the code it throws me value option is not a member of org.apache.spark.sql.DataFrame error.

However when i modify the code as below:

var olympics = spark.read.option("inferSchema","true").option("header","true").option("delimiter",",").csv("/FileStore/tables/Soccer_Data_Set_-c46d1.txt")

olympics dataframe is successfully created.

Can someone please help me understand the difference between these two code snippets?

Rayan Ral · Accepted Answer

After you've called csv method, you already have a DataFrame, and data is already read "into" spark, so it doesn't make sense to set options there.

In the second example, you're calling read to "say" that you want spark to read a file, setting properties of such read, and then actually reading the file.

value option is not a member of org.apache.spark.sql.DataFrame

Answers (2)

Related Questions