spark streaming foreach multiple writer

Question

I would like to have the structured streaming read from JSON file and process the data and write the data to Kafka and Parquet sinks. I see below sample code for same

datasetOfString.writeStream.foreach(new ForeachWriter[String] {
 def open(partitionId: Long, version: Long): Boolean = {
   // open connection
 }

 def process(record: String) = {
   // write string to connection
 }

 def close(errorOrNull: Throwable): Unit = {
   // close the connection
 }
})

But how can I pass multiple writers here? Is it like below?

datasetOfString.writeStream.foreach(kafkaWriter).start()
datasetOfString.writeStream.foreach(parquetWriter).start()

If I do like this then what would be purpose of using foreach writer? is it just for more control while writing?

spark streaming foreach multiple writer

Answers (1)

Related Questions