Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix accelerator device map #1913

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

vimalk78
Copy link
Collaborator

  • brings back .dockerignore file
  • fix the logic of adding accelerator device map
  • added few log statements

@vimalk78 vimalk78 requested a review from rootfs January 21, 2025 20:00
Copy link
Contributor

github-actions bot commented Jan 21, 2025

🤖 SeineSailor

Here is a concise summary of the pull request changes:

Summary: The "Fix accelerator device map" pull request refactors the accelerator device map logic, improving internal diagnostics and adding log statements for better visibility. Key modifications include corrections to device map addition logic and the introduction of the GetAllDevices() method to the Registry type.

Impact: These changes enhance the code's internal functionality without affecting the external interface or behavior, except for the added GetAllDevices() method. The readded .dockerignore file ensures proper Docker container setup.

Observations: The changes are focused on internal improvements, suggesting a refinement of the existing implementation rather than a significant architectural shift. The addition of log statements and diagnostics will aid in debugging and troubleshooting.

Suggestions: Consider adding tests to validate the corrected device map logic and the new GetAllDevices() method to ensure their correctness and reliability.

@rootfs
Copy link
Contributor

rootfs commented Jan 22, 2025

can you sign DCO?

@vimalk78
Copy link
Collaborator Author

can you sign DCO?

@rootfs signed DCO

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Adding .dockerignore will lead to incorrect version computation #1809

@@ -138,6 +139,7 @@ func (d *gpuDcgm) InitLib() (err error) {
if err != nil {
klog.Infof("There is no DCGM daemon running in the host: %s", err)
// embedded mode is not recommended for production per https://github.com/NVIDIA/dcgm-exporter/issues/22#issuecomment-1321521995
klog.Info("Attempting to inilialize dcgm in Embedded mode.")
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit typo: initialize

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants