Pull Queries: Logger in PullQueryExecutor results in Memory Leak #5527

AlanConfluent · 2020-06-02T17:42:32Z

Describe the bug
Discussed here:
https://confluentcommunity.slack.com/archives/C6UJNMY67/p1590998942212800

The logger factory io.confluent.common.logging.StructuredLoggerFactory keeps a ConcurrentHashmap of loggers used, never clearing them. If you look in PullQueryExecutor, the query id is passed as part of the name. This creates a large blowup in loggers as more queries come in.

To Reproduce
Steps to reproduce the behavior, include:
On master, generate a lot of pull queries, and set the java memory to something small. Users were able to hit this while generating high QPS for 30 minutes.

Expected behavior
No memory issues.

Screenshot from a user. This shows a persistent query (so doesn't fully display this case), but you can see that the query ids are part of the logger names and that they take a lot of memory:

AlanConfluent · 2020-06-02T19:18:46Z

I did a memory profile on this and I've confirmed that the memory use keeps going up and up with lots of pull queries and the source is io.confluent.common.logging.StructuredLoggerFactory

AlanConfluent · 2020-06-02T22:29:51Z

The loggers are not written to be used this way since the slf4j logger factory itself appears to keep references to the loggers it gives out by name.

I think the solution and intended use is more something like and MDC (http://logback.qos.ch/manual/mdc.html). That way you could always add the query id and other structured data.

AlanConfluent added bug needs-triage labels Jun 2, 2020

AlanConfluent mentioned this issue Jun 2, 2020

fix: Prevent memory leaks caused by pull query logging #5532

Merged

2 tasks

AlanConfluent self-assigned this Jun 2, 2020

AlanConfluent closed this as completed in #5532 Jun 3, 2020

the-cybersapien mentioned this issue Jun 9, 2020

Release build does not include beta artifacts' Maven repository #5483

Closed

mjsax removed the needs-triage label Mar 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pull Queries: Logger in PullQueryExecutor results in Memory Leak #5527

Pull Queries: Logger in PullQueryExecutor results in Memory Leak #5527

AlanConfluent commented Jun 2, 2020

AlanConfluent commented Jun 2, 2020

AlanConfluent commented Jun 2, 2020

Pull Queries: Logger in PullQueryExecutor results in Memory Leak #5527

Pull Queries: Logger in PullQueryExecutor results in Memory Leak #5527

Comments

AlanConfluent commented Jun 2, 2020

AlanConfluent commented Jun 2, 2020

AlanConfluent commented Jun 2, 2020