What is the best way to do metadata join in Flink datastream?

Question

We have a kafka stream of events that we want to enrich using some metadata that resides inside MySQL DB.

The metadata changes every few hours. Essentially we want to periodically read the DB and keep enriching the events with this new metadata.

One way could be to use Broadcast state with a periodic Source that reads DB every few minutes/hours. Broadcast this stream and use it to join. But the problem could be that the first read of the broadcast stream can be later than some of the messages being read from Kafka Stream.

Is there any better way?

What is the best way to do metadata join in Flink datastream?

Answers (1)

Related Questions