Reputation: 3245
What is a good pattern for aggregating the results from Kubeflow Pipleine kfp.ParallelFor?
Upvotes: 6
Views: 1285
Reputation: 21
Not exactly what you asked for, but our workaround was to write the results of the parallelfor tasks into S3 and simply collect them afterwards in a postprocessing task.
with dsl.ParallelFor(preprocessing_task.output) as plant_item:
predict_plant='{}'.format(plant_item)
forecasting_task = forecasting_op(predict_plant, ....).after(preprocessing_task)
postprocessing_task = postprocessing_op(...).after(forecasting_task)
Upvotes: 2
Reputation: 141
At the moment this might not be supported:
Support inputs with multiple arguments #1933
Upvotes: 0