Is it possible to work with Spark Structured Streaming without HDFS?

Question

I'm working with HDFS and Kafka for times, and I note that Kafka is more reliable than HDFS. So working now with Spark-structured-streaming , I'm suprised that checkpointing is only with HDFS. Chekckpointing with Kafka would be faster and reliable. So is it possible to work with spark structured streaming without HDFS ? It seems strange that we have to use HDFS only for streaming data in Kafka. Or is it possible to tell Spark to forget the ChekpPointing and managing it in the program as well ?

Spark 2.4.7

Thank you

Is it possible to work with Spark Structured Streaming without HDFS?

Answers (1)

Related Questions