Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update DCGM Persistent Tool to Utilize dcgm-exporter #2115

Closed
Maxusmusti opened this issue Feb 17, 2021 · 5 comments · Fixed by #2076
Closed

Update DCGM Persistent Tool to Utilize dcgm-exporter #2115

Maxusmusti opened this issue Feb 17, 2021 · 5 comments · Fixed by #2076
Assignees
Labels
Agent enhancement tools Of and related to the operation and behavior of various tools (iostat, sar, etc.)
Milestone

Comments

@Maxusmusti
Copy link
Member

This will allow us to stop using the old python2 script, and will also allow for both more metric options, as well as significantly simplified metric customization.

More details at:
https://github.com/NVIDIA/gpu-monitoring-tools

@Maxusmusti Maxusmusti self-assigned this Feb 17, 2021
@portante portante added this to the v0.71 milestone Feb 18, 2021
@portante portante added tools Of and related to the operation and behavior of various tools (iostat, sar, etc.) enhancement labels Feb 18, 2021
@ashishkamra
Copy link
Member

@Maxusmusti Any update on this ?

@Maxusmusti
Copy link
Member Author

Maxusmusti commented Mar 10, 2021

@ashishkamra This will be added either once PR #2076 is merged, or once PR 14 (in Peter's fork) for issue #2122 is merged on top of #2076 (currently complete, awaiting review). This would also be done on top of the #2076 changes, and unsure which should take precedence between this issue and issue #2121.

@dagrayvid
Copy link

@Maxusmusti Can you update here when/if there is a build available for this?

@Maxusmusti
Copy link
Member Author

@dagrayvid @ashishkamra The new dcgm-exporter update has been merged into PR #2076 (which itself will be merged into main branch this week). I will update again once official rpm/image builds are available, but in the meantime, unofficial builds of the new update can be found at https://copr.fedorainfracloud.org/coprs/meyceoz/pbench-test and quay.io/meyceoz. Also the official live-metric-visualizer and prom-graf-visualizer have already been updated to reflect the change to dcgm-exporter.

@Maxusmusti
Copy link
Member Author

I forgot to update again a couple weeks ago, but these changes are in the main branch now, so any builds off of the current main branch will include the update.

@portante portante linked a pull request Apr 8, 2021 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Agent enhancement tools Of and related to the operation and behavior of various tools (iostat, sar, etc.)
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants