Reputation: 19
I am trying to build data ingestion pipelines (ETL) using Google Cloud Platform I have python scripts that downloads public data, uploads it to cloud storage, and performs transformation on this data and uploads it to BigQuery These scripts have to be run on a schedule ( hourly & daily) We are considering two options to achieve this goal:
Option 1:
Option 2:
Which of these two options is better overall? Is there a comparison of cost, reliability & efficiency between the two methods?
Have tried both methods to build data ingestion pipelines and they work as expected
Upvotes: 0
Views: 169
Reputation: 75940
I have a better proposal:
Here, some explanation on that proposal:
Upvotes: 1