AWS Architecture: How to launch multiple & short lived EC2 instances and how to keep track of vCPU service quota / limit

Question

To conduct performance tests on different EC2 instance types I'd like to launch multiple ec2 instances for a short period of time. During the launch process I run a user_data (bash)-script to perform the measurements, store the result in a S3 bucket and shut the instance down.

My current approach:

Lambda function to get relevant Instance types
Lambda pushes instance types as individual messages to a SQS queue
The queue is configured to trigger a 2nd lambda function which launches for each message a instance with the user_data script to perform performance measurements

My problem: as the 2nd lambda function is processing the queue-messages and is spinning up new instances, it will hit the vCPU limit of my account. Because it may take up to 10min for each instance to complete the measurement, the retries also fail and the remaining messages end up the the DLQ.

Question: How can I launch new instances until the vCpu quota is reached and then spin up new ones (as running instances will shut down after the user_data script has finished). Probably I need to somehow keep track of my current vCPU usage/quota and invoke the lambda but was not able to come up with a good solution how to orchestrate the whole process (as I'm still a junior dev and fairly new to AWS).

Does anyone have a recommendation how to tackle that problem? any input is highly appreciated.

Thx a lot and BR!

AWS Architecture: How to launch multiple & short lived EC2 instances and how to keep track of vCPU service quota / limit

Answers (1)

Related Questions

AWS Architecture: How to launch multiple &amp; short lived EC2 instances and how to keep track of vCPU service quota / limit

Answers (1)

Related Questions

AWS Architecture: How to launch multiple & short lived EC2 instances and how to keep track of vCPU service quota / limit