How to recycle/delete inactive metrics automatically in the prometheus client

JustCat · March 22, 2023, 11:19am

As the number of metrics increases, the resource consumption (i.e. memory usage) of the Prometheus client will also increase. Therefore, we hope to make some enhancements on the basis of the Prometheus client to recycle/delete inactive metrics automatically.

The solution we propose now is to find metrics that have been inactive for a period of time (clock time or no-data count when scraped), and then unregister the idle Collector at CollectorRegistry.

We are curious if there are similar proposals or best practice available in the Prometheus community. Is there a way to

automatically clean up idle metrics on the client side
set an upper limit of the metrics that an application client can generate/register?

P.S. This topic may be labeled as “development” but I am not sure about it

stuart · March 22, 2023, 12:29pm

What do you mean by “idle metrics”?

JustCat · March 22, 2023, 1:27pm

The metrics that have not generated new data point for a long period (e.g. several queries or days)

stuart · March 22, 2023, 2:31pm

There isn’t really such a concept in Prometheus. Every scrape (which is generally say every minute) will produce a new data point in the TSDB (even if the value hasn’t changed). This should be the “current” value of whatever the metric represents - such as number of orders, CPU percentage, HTTP requests, etc.

One thing I’m wondering is if you are trying to use Prometheus to store events, which it isn’t designed for.

KevinKingKong · March 23, 2023, 1:58am

These metrics are sparse. For example, the GUAGE metric “HTTP Request success rate” includes URI tags. Some URIs are rarely accessed. Once accessed, they will remain in memory and cannot be automatically
cleaned up . The same value will always be obtained by Prometheus server.

stuart · March 27, 2023, 10:48am

It isn’t recommended to include URIs within labels, as that can lead to a cardinality explosion.

JustCat · March 28, 2023, 7:35am

Thank you for your prompt reply
We will take a closer look at our solution.

Topic		Replies	Views
Single Prometheus restarted after it ups for 2 weeks General Help/Support	4	627	May 19, 2023
Metric scrape efficiency Prometheus server	0	344	June 14, 2023
Unused Prometheus labels and metrics Prometheus server	0	581	May 9, 2023
Non-existent metrics Prometheus server	6	472	April 27, 2023
Prometheus exporter for SQL queries Exporters and Metrics	0	1117	March 21, 2023

How to recycle/delete inactive metrics automatically in the prometheus client

Related topics