Reputation: 331
I have a presto cluster on EMR. We also have auto scaling policy in place based on AWS Clouldwatch metrics. We have a issue when cluster scale-down the nodes, it doesn't wait for queries running on those nodes to complete. It just kills all queries and scale down. There is a property which graciously shut down the nodes, can be used in cluster configuration, but this has its limitations. It doesn't work with trino or prestoSql.This only works for Prestodb.
https://docs.aws.amazon.com/emr/latest/ReleaseGuide/presto-graceful-autoscale.html
I have also explored the cooldown
period in autoscaling policies but none of them seems to be working for our cluster.
I am wondering is this a known issue for prestosql/trino
Upvotes: 1
Views: 997
Reputation: 49944
AWS created a GitHub repo Amazon EMR - Trino Autoscale in March 2023 which I think will help.
In the readme, it mentions
This project provides a custom auto-autoscaling for Amazon EMR on EC2 clusters running with Trino. The package support out of the box Instance Groups and Instance Fleets clusters with On Demand / SPOT instances.
...
In order to use this utility, it is recommended to enable Trino fault-tolerant execution to avoid query failures during resize operations.
Upvotes: 0