Viraj
Viraj

Reputation: 797

KafkaIO checkpoint persistence with Google Dataflow Runner

I am trying to understand how the offsets and group management works with the Google Dataflow runner with KafkaIO reader. More specifically, I am trying to understand how offset management works:

Any code/document reference pointing in the right direction is appreciated.

Upvotes: 0

Views: 473

Answers (1)

Kenn Knowles
Kenn Knowles

Reputation: 6023

The KafkaIO reader is entirely part of Apache Beam. Google Cloud Dataflow does not treat this source differently than any other Beam source.

You can find its code at https://github.com/apache/beam/tree/master/sdks/java/io/kafka/src/main/java/org/apache/beam/sdk/io/kafka (in various files). I don't know of any reference documentation other than the Javadoc.

Upvotes: 0

Related Questions