-
Notifications
You must be signed in to change notification settings - Fork 255
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Polling Nvidia temperature keeps GPU awake #1291
Comments
This seems more like a bug - could you fill in the bug report form? |
That said, I could also look into adding GPU filtering, yes. Curious how that might look like though - would filtering by PCI info seem too confusing? |
Alternatively, I could filter by name + add options to disable any GPU activities for certain GPU names, in addition to more granular filtering for other widgets. Does the current dGPU show up by name in the temperatures tab? If you have a screenshot, that would be helpful. |
I like the idea. It should probably be done by index; to avoid device initialization by nvml's Alternatively a white list based approach could support uuid/pcie names pretty easily via Edit: Short term build without the gpu feature flag. PR 1276 should allow disabling of the gpu via config until filtering is done. This was probably introduced around 0.7.0 |
Some/all AMD GPUs are also affected. I have an RX580 that doesn't drive any monitors, and reading the hwmons wakes it up and keeps it awake. Unfortunately, it seems the The model is an ON/WARM/OFF state machine, where ON reads sensors and utilization, and transitions to WARM if utilization is 0 for some time, and WARM reads no sensors or util but transitions to OFF if the Theoretically you could also see Hmm... It seems that this should perhaps be fixed in the kernel. I have written a note to myself to report this to the hwmon mailing list/bug tracker. |
bottom already actually does a fairly simple check with |
If anyone can check, would be interested to see if a simple logic change in #1355 helps with it. |
@ClementTsang I've tried that branch, is it supposed to show Nvidia/GPU temps if it is already active? Currently it does not. |
The change would hide any entry for any device that's asleep; if it turns back on though in theory it should show up again... |
Mostly also just curious whether it stops the GPU from waking, or if there's more that I need to do in that part first. |
Seems like I don't. |
Hm, so the GPU is still waking up? |
Sorry mate. It looks like I had a brainfart.. The dgpu appears to not be waking. |
Just merged #1355, could you see in |
It doesn't wake it, but also does not show details if it is awake? It may be worth reading through this also https://gitlab.com/mission-center-devs/mission-center/-/issues/30#note_1697130114 |
Hmm... that's weird, thanks for the link. Also just curious, could you provide screenshots of what the temp table looks like on stable and on |
🤦 just realized that I never changed the sleep checks for nvidia GPUs... let me try looking at that too. |
Further information: reading |
Checklist
Describe the feature request
I noticed in a recent update the sensors tab (on linux) gained the dGPU temperature. On hybrid systems this is an issue as it causes the dGPU to stay awake and drain battery.
I can't see any easy option to disable this one sensor.
The text was updated successfully, but these errors were encountered: