Why does reading stream from Kafka fail with "Unable to find encoder for type stored in a Dataset"?

Question

I am trying to use Spark Structured Streaming with Kafka.

object StructuredStreaming {

  def main(args: Array[String]) {
    if (args.length < 2) {
      System.err.println("Usage: StructuredStreaming  ")
      System.exit(1)
    }

    val host = args(0)
    val port = args(1).toInt

    val spark = SparkSession
      .builder
      .appName("StructuredStreaming")
      .config("spark.master", "local")
      .getOrCreate()

    import spark.implicits._

    // Subscribe to 1 topic
    val lines = spark
      .readStream
      .format("kafka")
      .option("kafka.bootstrap.servers", "localhost:9093")
      .option("subscribe", "sparkss")
      .load()
    lines.selectExpr("CAST(key AS STRING)", "CAST(value AS STRING)")
      .as[(String, String)]
    }
}

I got my code from Spark documentation and I got this build error :

Unable to find encoder for type stored in a Dataset. Primitive types (Int, String, etc) and Product types (case classes) are supported by importing spark.implicits._ Support for serializing other types will be added in future releases. .as[(String, String)]

I read on other SO post that it was due to the lack of import spark.implicits._. But it does not change anything for me.

UPDATE :


    UTF-8
    1.7.12
    2.1.0
    2.10.4
    2.10




    
        org.apache.spark
        spark-core_2.10
        2.1.0
    

    
        org.apache.spark
        spark-sql_2.10
        2.1.0
    
    
        org.apache.spark
        spark-sql-kafka-0-10_2.10
        2.1.0

ImbaBalboa · Accepted Answer

Well, I tried with scala 2.11.8

2.11.8
2.11


    
        org.apache.spark
        spark-core_2.11
        2.1.0
    
    
        org.apache.spark
        spark-sql_2.11
        2.1.0
    
    
        org.apache.spark
        spark-sql-kafka-0-10_2.11
        2.1.0

and with corresponding dependencies (for scala 2.11) and it eventually worked.

Warning : You need to restart your project on intelliJ, I think there are some problems when changing version and not restarting, the errors are still there.

Why does reading stream from Kafka fail with "Unable to find encoder for type stored in a Dataset"?

Answers (1)

Related Questions

Why does reading stream from Kafka fail with &quot;Unable to find encoder for type stored in a Dataset&quot;?

Answers (1)

Related Questions

Why does reading stream from Kafka fail with "Unable to find encoder for type stored in a Dataset"?