Skip to content

[UR][CUDA] Remove NVML xfails after driver update#21625

Draft
kekaczma wants to merge 1 commit intosyclfrom
fix-cuda-nvml-xfails
Draft

[UR][CUDA] Remove NVML xfails after driver update#21625
kekaczma wants to merge 1 commit intosyclfrom
fix-cuda-nvml-xfails

Conversation

@kekaczma
Copy link
Contributor

Remove xfails from 4 CUDA conformance tests that require NVML:

  • SuccessThrottleReasons (UR_DEVICE_INFO_CURRENT_CLOCK_THROTTLE_REASONS)
  • SuccessFanSpeed (UR_DEVICE_INFO_FAN_SPEED)
  • SuccessMaxPowerLimit (UR_DEVICE_INFO_MAX_POWER_LIMIT)
  • SuccessMinPowerLimit (UR_DEVICE_INFO_MIN_POWER_LIMIT)

These tests were failing with 'Driver/library version mismatch' due to incompatibility between libnvidia-ml.so in the container (550.144) and the NVIDIA driver on the CI host.

After CI infrastructure update to driver version 550.144, these tests should now pass.

Remove xfails from 4 CUDA conformance tests that require NVML:
- SuccessThrottleReasons (UR_DEVICE_INFO_CURRENT_CLOCK_THROTTLE_REASONS)
- SuccessFanSpeed (UR_DEVICE_INFO_FAN_SPEED)
- SuccessMaxPowerLimit (UR_DEVICE_INFO_MAX_POWER_LIMIT)
- SuccessMinPowerLimit (UR_DEVICE_INFO_MIN_POWER_LIMIT)

These tests were failing with 'Driver/library version mismatch' due to
incompatibility between libnvidia-ml.so in the container (550.144) and
the NVIDIA driver on the CI host.

After CI infrastructure update to driver version 550.144, these tests
should now pass.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant