Throughput calculation in Prometheus metrics from post_seconds_count, post_seconds_sum and post_seconds_max

Question

I was trying to build dashboard in Grafana for different API( java application). we started exporting the metrics to Prometheus by using these dependency.

  val prometheus_scdw = "io.prometheus" % "simpleclient_dropwizard" % "0.0.23"
  val prometheus_schs = "io.prometheus" % "simpleclient_hotspot" % "0.9.0"
  val prometheus_scg = "io.prometheus" % "simpleclient_guava" % "0.9.0"
 Metrics which we can see in exporter is like this( just for example): 
 # HELP controllers_autouserprofilecontroller_autologin_post_seconds_max  
 # TYPE controllers_autouserprofilecontroller_autologin_post_seconds_max gauge
 controllers_autouserprofilecontroller_autologin_post_seconds_max 0.075604753
 # HELP controllers_autouserprofilecontroller_autologin_post_seconds  
 # TYPE controllers_autouserprofilecontroller_autologin_post_seconds summary
controllers_autouserprofilecontroller_autologin_post_seconds_count 2529959.0
controllers_autouserprofilecontroller_autologin_post_seconds_sum 80214.121718928

I tried to see in GitHub to understand what exactly its means when they say count,sum or max but i didn't find any explanation. going with standard definition of these words like count is request severed, sum is time taken to served the request, max is highest time to served the request.

still wanted to ask if there is any better way or medium to understand these metrics.

I also used query for throughput for http_request_total to match the request counts in ALB monitoring which doesn't match. Query used: sum(increase(http_request_total[1m]))

Is there anything i am missing here or small percentage of mismatch is acceptable.

My target is to build kind of dashboard for API performance, given currently we are exporting mentioned metrics type for all the API.

Throughput calculation in Prometheus metrics from post_seconds_count, post_seconds_sum and post_seconds_max

Answers (1)

Related Questions