What causes the Azure API Management time-out between request received and backend execution

Question

I have an Azure API Management resource (P2V2 tier) using Azure Functions running on a App Service Plan (P2V2 to avoid cold-starts). One of the services calling my API is having a 1 second limit before it cancels the call.

When I trace the errors in APIM's logs for these canceled calls, I can see that we sometime receive the request, but do not trigger the Azure Function, i.e. the "backend" of the API, until up to 300-600ms later after receiving it (see image). This sometime triggers a chain reaction causing the call to exceed 1 second in total execution time.

Normal backend execution times varies between 30-60ms but there are outliers.

What is causing this delay and can I do something about it? Can it got something to do with the number of instances (scale out) of the Function running reaching a max, and if so how can I know if I need to scale out?

What causes the Azure API Management time-out between request received and backend execution

Answers (1)

Related Questions