Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Testing: Temporary fix of gpu driver for Rocky Linux 8/9 #1522

Merged
merged 4 commits into from
Nov 23, 2023

Conversation

LujieDuan
Copy link
Contributor

@LujieDuan LujieDuan commented Nov 22, 2023

Description

The NVML and DCGM integration tests are failing on Rocky Linux 8 and 9: with Rocky Linux 9.3 and 8.9 released, the current GCE images can no longer install the matching kernel-devel packages.

The fix here is to override the system repos to use the repo of the matching release version. The fix should be removed once new images become available; the fix only targets 9.2 and 8.8 images so should not break the tests when new images are available.

Related issue

b/312455563

How has this been tested?

Integration tests passing on RL 8/9 for NVML/DCGM.

Checklist:

  • Unit tests
    • Unit tests do not apply.
    • Unit tests have been added/modified and passed for this PR.
  • Integration tests
    • Integration tests do not apply.
    • Integration tests have been added/modified and passed for this PR.
  • Documentation
    • This PR introduces no user visible changes.
    • This PR introduces user visible changes and the corresponding documentation change has been made.
  • Minor version bump
    • This PR introduces no new features.
    • This PR introduces new features, and there is a separate PR to bump the minor version since the last release already.
    • This PR bumps the version.

@LujieDuan LujieDuan marked this pull request as ready for review November 22, 2023 23:00
@LujieDuan LujieDuan force-pushed the lujieduan-dcgm-rocky-fix branch from 788bf2a to 7ab194e Compare November 23, 2023 18:19
@LujieDuan LujieDuan requested review from a team, sophieyfang and ridwanmsharif and removed request for a team and sophieyfang November 23, 2023 19:16
Copy link
Contributor

@ridwanmsharif ridwanmsharif left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added a nit and a question. Feel free to merge once addressed

@LujieDuan LujieDuan merged commit c3f452a into master Nov 23, 2023
9 checks passed
@LujieDuan LujieDuan deleted the lujieduan-dcgm-rocky-fix branch November 23, 2023 21:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants