Reputation: 552
I have a Dataflow pipeline running that fetches a configuration of active tenants (stored in GCS) and feeds it into an ActiveTenantFilter
as a sideInput. The configuration is rarely updated, hence why I decided to re-deploy the pipeline, using the --update
flag, whenever it is updated.
However, when using the update flag, the file is not fetched again, i.e., the state is maintained. Is it possible to enforce that this PCollectionView
is updated whenever the pipeline is re-deployed?
Upvotes: 0
Views: 264
Reputation: 5104
You are correct, when you --update
a pipeline it will process new data but will not re-load old data. It sounds like what you want is slowly updating side inputs which unfortunately has not been implemented yet. You could instead try draining and re-starting your pipeline.
Upvotes: 1