Dataflow control high fanout between steps

Question

I have 3 dataflow steps in a Dataflow pipeline.

Reads from pubsub , saves in a table and splits into multiple events(puts into context output).
For each split, queries db and decorates the event with additional data.
Publishes to another pubsub topic for further procession.

PROBLEM:
After step 1, its splitting into 10K to 20K events.

Now in step 2 its running out of database connections. (I have a static hikari connection pool).

It works absolutely fine will less data. I am using a n1-standard-32 machine.

What should I do to limit the input to the next step? So that the parallelism is restricted or throttle events to next step.

Answers (1)