PromQL shows different results in two k8s clusters

Paul · August 23, 2023, 7:25am

I’ve been using Grafana Dashboards for some time to monitor K8’s cluster.
For this I use parts of the Grafana Dashboard page (Kubernetes cluster monitoring (via Prometheus)).

But I currently have different behavior on different K8s clusters, essentially it’s about this PromQL:

`sum (container_memory_working_set_bytes{id="/",kubernetes_io_hostname=~"^$Node$"}) / sum (machine_memory_bytes{kubernetes_io_hostname=~"^$Node$"}) * 100`

On one cluster I get a result the other reported: “N/A”

On the Prometheus GUI, I have already found out that the error is essentially related to the metric or the value {id=“/”}.

Here’s a test with the shortened PromQL:

container_memory_working_set_bytes{id="/"}

This PromQL shows me all nodes belonging to the cluster on one K8s cluster on the other cluster only an “Empty query result”.

Does anyone have any ideas what is going wrong here?

stuart · August 23, 2023, 7:43am

That suggest that no metrics are being recorded of that type for the other cluster. What does the targets page look like? Any errors shown, and does it have the scrape config for that metric source? How are you configuring your scrape jobs - static config, Prometheus Operator, Kubernetes service discovery with relabling, something else?

Paul · August 23, 2023, 8:09am

I use the promtheus operator (Quay)
Automation ensures that the same version is always present on each cluster.

My PromQL mentioned has also worked in the past and displayed data on each cluster.
Unfortunately, I can’t find that much online or in communities, which is usually the indicator that you’ve got yourself into a problem.

I also don’t know where to start looking, whether it’s related to an update from dockerd to containerd, or some RPM update in the RedHat environment on the individual nodes?
The Metric/Value {id=“/”} is somehow not clear to me, what exactly it does.

stuart · August 23, 2023, 8:37am

As mentioned take a look at the Prometheus targets page. Look for any errors and see if that job exists.

Paul · August 23, 2023, 9:14am

I had already done that, found no errors.

stuart · August 23, 2023, 9:36am

And the job that would generate these metrics was listed?

Paul · August 23, 2023, 12:02pm

Oh, did I look for errors in wrong place?
I checked for errors here: http://nodenameXYZ:9100/metrics

Basically, the current issue is only about the result of this PromQL on two different clusters - I’m attaching a screenshots here - ClusterA and B

stuart · August 23, 2023, 2:06pm

You want to be looking at the /targets page in Prometheus. Specifically look at the “kubelet” job.

Paul · August 23, 2023, 2:27pm

Ah, Thanks, got it - no errors here either.

stuart · August 23, 2023, 3:31pm

So you see a job “kubelet” listed with no errors? What targets does it have?

Paul · August 28, 2023, 7:31am

Yes, I see some jobs “kubelet”, all without errors.

It’s not that the metrics don’t deliver any values, they all work, only one cluster unfortunately delivers values that differ from the other clusters.
I just want to know why that is.

It would be interesting for me how “container_memory_working_set_bytes{id=”/“}” behaves on others.

Paul · September 5, 2023, 2:38pm

The bug was found.
Newer NodeExporters no longer show what is expected.

My “kube-prometheus-stack” is currently version 50.3.1 including node exporter version quay.io/prometheus/node-exporter:v1.5.0
Here the Prometheus GUI with the PromQL “container_memory_working_set_bytes{id=”/“}” unfortunately shows an “Empty query result”

Previously, the “kube-prometheus-stack” was used in version 34.9.0 - including the following node exporters:
quay.io/prometheus/node-exporter:v1.3.1 - I restored this version.
With this “kube-prometheus-stack” a correct result, at least what I want, is displayed with the PromQL specified above.

Topic		Replies	Views
Query error in Grafana for Prometheus PromQL	0	483	October 6, 2021
Prometheus for wide clusters about 1000 nodes General Help/Support	1	736	March 18, 2022
Simple metrics from prometheus_client_php all messed up when querying in Grafana General Help/Support	0	174	November 24, 2023
Remote Scraping K8 Cluster Prometheus server	0	115	September 24, 2024
Complicated rest api of prometheus not working PromQL	1	2493	May 27, 2021

PromQL shows different results in two k8s clusters

Related topics