Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

scrape jobs never close connection #136

Closed
helgharbi opened this issue Dec 13, 2023 · 4 comments · Fixed by canonical/prometheus-hardware-exporter#64
Closed

scrape jobs never close connection #136

helgharbi opened this issue Dec 13, 2023 · 4 comments · Fixed by canonical/prometheus-hardware-exporter#64
Assignees
Labels
bug Something isn't working
Milestone

Comments

@helgharbi
Copy link

We received errors Call to the Redfish API failed (RedfishCallFailed)

in the log we have

INFO Login returned code 401: {"error":{"@Message.ExtendedInfo":[{"Message":"Unable to complete the operation because an invalid username and/or password is entered, and therefore authentication failed.","MessageArgs":[],"[email protected]":0,"MessageId":"IDRAC.2.5.SYS415","RelatedProperties":[],"[email protected]":0,"Resolution":"Enter valid user name and password and retry the operation.","Severity":"Warning"}],"code":"Base.1.8.GeneralError","message":"A general error has occurred. See ExtendedInfo for more information"}}

After investigation we found that Redfish connection to iDRAC never ends causing new connection slow to establish.

racadm getssninfo
SSNID Type              User                                IP Address Login Date/Time    
---------------------------------------------------------------------------
249   REDFISH           root                               a.b.c.d 12/11/2023 11:40:25
250   REDFISH           root                               a.b.c.d 12/11/2023 11:40:33
254   REDFISH           root                               a.b.c.d 12/11/2023 11:46:25
255   REDFISH           root                               a.b.c.d 12/11/2023 11:46:33
257   REDFISH           root                               a.b.c.d12/11/2023 11:47:37
258   REDFISH           root                               a.b.c.d 12/11/2023 11:47:39
261   REDFISH           root                               a.b.c.d 12/11/2023 11:53:24
262   REDFISH           root                               a.b.c.d 12/11/2023 11:53:31

we found a workaround by running racadm racreset soft but the problem keeps coming back.

@Pjack Pjack added the bug Something isn't working label Dec 26, 2023
@Pjack Pjack added this to the 23.10.3 milestone Dec 26, 2023
@Pjack Pjack modified the milestones: 23.10.3, 23.10.4 Jan 30, 2024
@przemeklal
Copy link
Member

przemeklal commented Feb 22, 2024

Possibly related to canonical/prometheus-hardware-exporter#48

@chanchiwai-ray chanchiwai-ray self-assigned this Mar 4, 2024
@chanchiwai-ray
Copy link
Contributor

chanchiwai-ray commented Mar 4, 2024

Are you able to provide more logs from the exporter service, or from get more information racadm cli to help us troubleshoot the issue? From the source code where the redfish login and logout happens, we can't identity why it's being happening; there are logout command accompanied by login command...

@przemeklal
Copy link
Member

Shared internally.

@chanchiwai-ray
Copy link
Contributor

The PR I link can potentially reduce the occurrence of this (or even fix the issue altogether) if it's caused by the case you shared.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants