Spark word count example : why error: value split is not a member of Char at the REPL?

Question

From the Spark example (https://spark.apache.org/examples.html) , the code looks like:

    val file = spark.textFile("hdfs://...")
     val counts = file.flatMap(line => line.split(" "))
                 .map(word => (word, 1))
                 .reduceByKey(_ + _)

And works when compiled. However, if i try this exact code at the Spark REPL:

scala> val lines = "abc def"
lines: String = abc def

scala> val words = lines.flatMap(_.split(" "))
:12: error: **value split is not a member of Char**
       val words = lines.flatMap(_.split(" "))
                                   ^

What gives??

thanks Matt

Justin Pihony · Accepted Answer

lines is just a string. So flatmap is being run against a sequence of characters. You need to use an RDD

val rddYouCanUse = sc.parallelize(List("abc def"))

Spark word count example : why error: value split is not a member of Char at the REPL?

Answers (2)

Related Questions