Delete log streams via API #577

cyriltovena · 2019-05-14T03:02:13Z

Is your feature request related to a problem? Please describe.

When the users will want to tweak Promtail scrape configuration (relabeling, pipeline, discovery) to work with their own deployments, they'll probably generate some unused or unwanted Loki streams.

It would be nice if we could search/filter through streams and ordered them by last chunk received, throughput and total size. Then be able to deleted those we don't want anymore.

This could also help users to create some sort of cron job to apply retention. @sandlis

We could use prometheus to scrape promtail and who creates those metrics. Loki could then use this to query for use within the API.

Describe the solution you'd like

1 - List all streams ordered by last data seen and throughout over the last 5minutes, we should be able provide label matcher {app="mongod" cluster="us-east1"} to narrow down the search, this should be a query limited by 100 results by default.
2 - Delete a list of stream via their labelset and may be from a specific time.

I feel like this could be useful to have in Grafana some sort of stream management UI @davkal ?

The text was updated successfully, but these errors were encountered:

tomwilkie · 2019-05-15T12:58:17Z

Lets try and match the Prometheus API here: https://prometheus.io/docs/prometheus/latest/querying/api/#delete-series

cyriltovena · 2019-08-28T13:10:09Z

We should start with GET, https://prometheus.io/docs/prometheus/latest/querying/api/#finding-series-by-label-matchers

/cc @davkal

slim-bean · 2019-08-28T13:53:07Z

related to #113

stale · 2019-09-27T14:22:37Z

This issue has been automatically marked as stale because it has not had any activity in the past 30 days. It will be closed in 7 days if no further activity occurs. Thank you for your contributions.

blocke · 2019-11-28T00:54:09Z

This ticket was closed but #113 only covers the GET portion. I'm hoping the DELETE portion gets included too as I just spent a couple hours fighting an incorrect timestamp parsing pipeline. Should a reminder for DELETE be added to #113 or can this ticket be saved from the stale bot?

Thanks.

bastjan · 2020-02-17T20:38:35Z

I've hacked together a rough prototype of the delete API after seeing cortexproject/cortex#2103 and blowing up my index with an erroneously parsed timestamp.

Is anyone working on the delete API on the Loki team? I could finish up my prototype over the next 2-3 weeks if needed.

sandeepsukhani · 2020-02-17T21:15:23Z

Not yet. I appreciate your efforts! We would like to get it working for Cortex and then reuse most of the code in Loki.
Also out of curiosity, was the issue that you were facing introduced by code from that Cortex PR? I would like to fix it if it is.

bastjan · 2020-02-18T07:26:56Z

My issue was unrelated. I parsed the "time since boot" counter from the linux kernel log as label by accident.

The cortex PR allowed me to use the new storage.DeleteSeriesIDs. Which made it quite easy to implement the remainder :)

daveram · 2020-04-23T05:17:06Z

Is there any way to manually go in and delete all instances of a label? We're running into the scenario where we started collecting the wrong logs under the wrong label name, promtail reconfig was nice and easy but now we have the old labels hanging out with invalid logs. Would like to clean up, and since the API doesn't support this like Prometheus yet, hoping there's a more manual way.

We're using two different setups, one with filesystem and boltdb for chunks/index, the other with s3 and cassandra. Both have the same scenario happening

ayZagen · 2020-06-12T13:44:48Z

We also need this also. It would be great to have something like in prometheus. link

tibin-mfl · 2020-06-15T12:19:22Z

Do we have any update on this, Is delete api available on Loki ?
What can we do if we want to put some on demand logs to loki for some some issue debugging and later delete that log stream.

For Example :
We push DB logs to loki for analysing some production issue after issue debugging I don't want the logs so I need to delete that logs to reduce unnecessary storage.

cyriltovena · 2020-06-15T12:47:36Z

It's actively being worked on by @sandeepsukhani

Feel free to give him support, it's OpenSource !

Tschis · 2020-07-30T11:59:29Z

@cyriltovena Hi! Thanks for the info! 💪

But while it is not done, do you know of a workaround that can be used?
Due to some circumstances we created labels on a field that went above 100,000 entries. Now we are stuck with an error message cardinality limit exceeded for logs{log_code}; 123474 entries, more than limit of 100000.

Since we hold our logs for many days, is there currently any way I can manually delete these label values?

diemus · 2020-09-16T07:42:45Z

I have to delete all data to solve the "cardinality limit exceeded for logs ....more than limit of 100000" problem. and I have no idea which label is high cardinality. is there a tool to see this? I just found the index size grow fast these days, but can not find the cause

cyriltovena · 2020-09-16T08:53:26Z

We recently released a way to see that with logcli. (aka --analyze-labels)

see:

It does exactly what you want ✌️

AMoghrabi · 2020-11-05T21:19:37Z

@cyriltovena Thanks for the info. Are you aware of a way to actually delete the labels? I couldn't find information on that in the PR or blog. Thanks!

tmikaeld · 2020-11-23T13:54:57Z

Same problem here, need to clear the logs due to GDPR request.

Seems silly to wipe all logs, when there are only a few labels we need to clear.

sandstrom · 2021-03-17T15:47:10Z

On the topic of log deletion, I think it would be even more useful if we could configure retention. E.g. keep everything for X days/months/years.

Or even better, since logs are usually stored on S3 (or similar storage), and loki is append-only (mostly?), maybe we could simply configure our S3 bucket to delete all files older than X days.

That would also work nicely with protecting logs, since we could use e.g. S3 object lock to prevent deletion of logs until X days have passed.

If you do add support for deleting streams via the API, don't enable it by default. For security, it's great that log shippers (anyone sending data to loki) can't delete it). For security, I think most setups will want to handle retention in loki or on the underlying storage (e.g. S3 bucket deletion rules).

justinmchase · 2021-05-24T16:42:05Z

I need this also, its crazy how easy it is to add a bad label and then have it totally break your entire log database to the point where you need to wipe out all of your log data to repair it.

This should be a pretty high priority to fix imo.

michaelkebe · 2021-08-25T08:49:01Z

Version 2.3.0 has this feature now.

https://grafana.com/docs/loki/latest/operations/storage/logs-deletion/

09jvilla · 2022-01-07T20:38:47Z

Closing this issue now that the log deletion api has been released.

pgassmann · 2023-07-10T07:47:14Z

The documentation for Log deletion API has been moved here:
https://grafana.com/docs/grafana-cloud/data-configuration/logs/delete-log-lines/
https://grafana.com/docs/loki/latest/api/#request-log-deletion

cyriltovena added component/loki type/feature Something new we should do labels May 14, 2019

stale bot added the stale A stale issue or PR that will automatically be closed. label Sep 27, 2019

stale bot closed this as completed Oct 4, 2019

cyriltovena reopened this Nov 28, 2019

stale bot removed the stale A stale issue or PR that will automatically be closed. label Nov 28, 2019

cyriltovena added the keepalive An issue or PR that will be kept alive and never marked as stale. label Nov 28, 2019

slim-bean assigned sandeepsukhani Jun 4, 2020

slim-bean changed the title ~~Get and Delete log streams via API~~ ~Get~ and Delete log streams via API Jun 4, 2020

slim-bean changed the title ~~~Get~ and Delete log streams via API~~ Delete log streams via API Jun 4, 2020

gdemonet mentioned this issue Jul 29, 2020

Add Loki size-based purge mechanism scality/metalk8s#2685

Open

Lawrence-Pet mentioned this issue Jan 5, 2021

What do you want from Loki in 2021? #3119

Open

sandstrom mentioned this issue Aug 25, 2021

Support S3 life-cycle policies and deletion prevention #4215

Open

09jvilla closed this as completed Jan 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Delete log streams via API #577

Delete log streams via API #577

cyriltovena commented May 14, 2019

tomwilkie commented May 15, 2019 •

edited

Loading

cyriltovena commented Aug 28, 2019

slim-bean commented Aug 28, 2019

stale bot commented Sep 27, 2019

blocke commented Nov 28, 2019

bastjan commented Feb 17, 2020

sandeepsukhani commented Feb 17, 2020

bastjan commented Feb 18, 2020

daveram commented Apr 23, 2020

ayZagen commented Jun 12, 2020

tibin-mfl commented Jun 15, 2020

cyriltovena commented Jun 15, 2020

Tschis commented Jul 30, 2020

diemus commented Sep 16, 2020

cyriltovena commented Sep 16, 2020 •

edited

Loading

AMoghrabi commented Nov 5, 2020

tmikaeld commented Nov 23, 2020 •

edited

Loading

sandstrom commented Mar 17, 2021 •

edited

Loading

justinmchase commented May 24, 2021

michaelkebe commented Aug 25, 2021

09jvilla commented Jan 7, 2022

pgassmann commented Jul 10, 2023

Delete log streams via API #577

Delete log streams via API #577

Comments

cyriltovena commented May 14, 2019

tomwilkie commented May 15, 2019 • edited Loading

cyriltovena commented Aug 28, 2019

slim-bean commented Aug 28, 2019

stale bot commented Sep 27, 2019

blocke commented Nov 28, 2019

bastjan commented Feb 17, 2020

sandeepsukhani commented Feb 17, 2020

bastjan commented Feb 18, 2020

daveram commented Apr 23, 2020

ayZagen commented Jun 12, 2020

tibin-mfl commented Jun 15, 2020

cyriltovena commented Jun 15, 2020

Tschis commented Jul 30, 2020

diemus commented Sep 16, 2020

cyriltovena commented Sep 16, 2020 • edited Loading

AMoghrabi commented Nov 5, 2020

tmikaeld commented Nov 23, 2020 • edited Loading

sandstrom commented Mar 17, 2021 • edited Loading

justinmchase commented May 24, 2021

michaelkebe commented Aug 25, 2021

09jvilla commented Jan 7, 2022

pgassmann commented Jul 10, 2023

tomwilkie commented May 15, 2019 •

edited

Loading

cyriltovena commented Sep 16, 2020 •

edited

Loading

tmikaeld commented Nov 23, 2020 •

edited

Loading

sandstrom commented Mar 17, 2021 •

edited

Loading