-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Traces not showing in GUI when viewing sessions with large no of traces #82
Comments
Good morning, @andyozj! I'm here to help you with the issue you're facing. Let's work together to find a solution. To address the issue of traces not showing in the GUI for sessions with a large number of traces, you can consider the following steps:
By implementing these optimizations and configurations, you should be able to improve the performance and resolve the timeout issues when viewing sessions with a large number of traces. To continue talking to Dosu, mention @dosu. Help Dosu learn! Give it feedback: Great Response | Irrelevant Answer | Incorrect Sources | Too Verbose | Hallucination | Bug Report | Other |
@andyozj Do you see whether any of the Langfuse Pods fail with an out of memory error or are being killed? |
@Steffen911, I noticed Langfuse Pods are being restarted, but not OOM caused. I've also tried increasing memory size of clickhouse to 16gb, to no avail but I'm out of ideas as to what could have caused it. |
@andyozj Can you connect to the ClickHouse database and check the error log at |
@Steffen911 there is no |
@andyozj Mh, can you see anything in the clickhouse container logs? |
@Steffen911 there's errors from the clickhouse container logs when I re-try to open the session:
|
@andyozj Can you estimate how many observations there are that relate to those 20000 sessions? And what's the total number of scores, traces, and observations within the system? We're not aware of major performance problems around that endpoint, but I see how your load pattern could cause them. Given the information above, I can try to reproduce them and see if there are further optimizations we can do in the queries. I can say ahead of time that this will be a bit more involved to fix. |
We have difficulties viewing 4 sessions primarily. These sessions have 67300, 25674, 25674 and 16000 traces respectively. I've done the query and receive the following results in the database:
Roger on the expected efforts required for this potential fix. Thank you very much! edit: For now, I've instructed my users to cap each observation to a total of 1 ~ 2 k traces for minimal stress during loading. |
Good Morning Developers!
I'm self-hosting langfuse v3 through helm chart v0.10.2.
Used the default helmchart dependencies for clickhouse, m`inio and valkey, but existing postgres set-up.
I've also increased all deployments/sts to reasonable resources such as 2cpu,8gb memory.
My developers team has submitted jobs of sessions with traces counts >20k, and have difficulty viewing them.
I've increased
NODE_OPTIONS
tovalue: --max-old-space-size=12288
, for both langfuse web/worker with 4cpu and 16gpu memory.The website will "stuck" at the following status for awhile before entering a "loading" state.

Upon monitoring the logs of the langfuse-web pod, I noticed that there's the following error:
Any suggestions/ potential resolutions will be good!
The text was updated successfully, but these errors were encountered: