Reputation: 931
After some research and testing, we have decided to start using Google Cloud Composer. Since our current DAGs and tasks are relatively small, and don't require the server to run continuously, I am looking how to manage costs.
Two questions:
preemptible
VMs seems logical. This saves costs considerably, and I'm thinking to go for 3x n1-standard-4
. I expect each task to be quite short, so don't think this will have significant impact for our workloads. Is it possible to use preemptible
VMs with Composer?Help, anyone?
Upvotes: 6
Views: 2985
Reputation: 2331
This is an interesting question.
One roadblock you may encounter is the nature of Airflow itself. Generally, Airflow is not intended for use ephemerally. Instead, I'd suspect that the vast majority of Airflow use, Cloud Composer or otherwise, is persistent. Ephemerality brings cost benefits but also risks with Airflow architecture. For example, what happens if the scheduler to restart your Airflow resources fails?
To answer your questions:
gcloud
) to connect to your projectIn the long-term, I think Composer will better support ephemeral use of worker resources. In the short term, another option is to run a lightweight Airflow environment on a small(ish) GCE VM and then suspend/resume that VM when you need to use Airflow. You don't get Composer that way, but you do benefit from the team's work improving and expanding GCP support in core Airflow.
Upvotes: 8