Reputation: 1518
As I understand, Spark can analyze streams with Spark Streaming. And Kafka can receive data from multiple sources. What I don't understand is, if i have a Kafka cluster receiving data from multiple sources, will the data be send to a database with Spark Streaming running? Or is Spark Streaming running on a application server?
Upvotes: 0
Views: 150
Reputation: 62310
If you use Spark Streaming, you need to set up a Spark cluster and you will submit you Spark Streaming job to the cluster. Thus, you will have to 2 cluster: Kafka + Spark (or actually 3, as you also need a Zookeeper cluster for Kafka).
Upvotes: 1