Prometheus stale metrics with remote-write-receiver vs active scraping

Question

I'd have a question related to the different behaviors I observe when I use Prometheus only as database (remote-write-receiver enabled) vs as a metric collector service (Prometheus actively scrapes an endpoint).

I have two dummy setups (as docker containers):

Prometheus (v2.40.0) is configured to scrape a Fluent bit (v2.0.3) service's prometheus_exporter output.
Prometheus (v2.40.0) is configured with --enable-remote-write-receiver flag and similarly, a Fluent bit (v2.0.3) writes the same data as in setup 1. to the Prometheus's remote write endpoint.

When I stop Fluent bit in setup 1. and I plot the Graph of a selected metric, I see that the graph breaks at the time point where Fluent bit was stopped. However, in setup 2 the same actions result in Prometheus still drawing the graph returning the last received value for 5 more minutes.

If I understand correctly, what happens in setup 2. is the expected behavior in case a metric goes stale. However, according to my understanding, this should be the expected behavior in setup 1. as well, since I haven't reconfigured the query.lookback-delta in either setups.

I tried reading documentations, but I cannot find a clear explanation to this difference, though this might be a result of my lack of domain knowledge in Prometheus. :(

I would really appreciate if anyone could help me understand the differences that might have caused these distinct behaviors. I'm sorry if this is a dummy question, I'm just starting to get acquainted to Prometheus.

Prometheus stale metrics with remote-write-receiver vs active scraping

Answers (1)

Related Questions