Fix CUDA version hardcoding to support CUDA 13+ dynamically #26518

GrigoryEvko · 2025-11-07T02:12:00Z

Description

Fixes runtime library loading failures when building with CUDA 13 by replacing hardcoded CUDA 12 references with dynamic version detection.

Related to #26516 which updates CUDA 13 build pipelines, but this PR fixes the Python runtime code that was still hardcoded to CUDA 12.

Problem

The build system correctly detects CUDA 13 via CMake, but the runtime Python code had CUDA 12 hardcoded in multiple locations, causing "CUDA 12 not found" errors on CUDA 13 systems.

Solution

Modified onnxruntime/init.py and setup.py to dynamically use the detected CUDA version instead of hardcoded "12" strings.

Changes

Dynamic CUDA version extraction from build info
Library paths now use f-strings with cuda_major_version
Added CUDA 13 support to extras_require and dependency exclusions
Fixed TensorRT RTX package to use correct CUDA version
Updated version validation to accept CUDA 12+
Fixed PyTorch compatibility checks to compare versions dynamically

Impact

CUDA 13 builds now load correct libraries
Backward compatible with CUDA 12
Forward compatible with future CUDA versions

Testing

Verified with CUDA 13.0 build that library paths resolve correctly and preload_dlls() loads CUDA 13 libraries without errors.

The runtime Python code had CUDA 12 hardcoded in multiple locations, causing library loading failures when building with CUDA 13. Changes in onnxruntime/__init__.py: - Dynamic library path generation using detected CUDA version - Updated version validation to accept CUDA 12+ - Fixed PyTorch compatibility check to compare CUDA versions dynamically - Updated diagnostic checks to report correct nvidia packages Changes in setup.py: - Added is_cuda_version_13 flag and proper version parsing - Added CUDA 13 library versions to dependency exclusion list - Added CUDA 13 extras_require with nvidia-*-cu13 packages - Fixed TensorRT RTX to use dynamic CUDA version Fixes runtime "CUDA 12 not found" errors on CUDA 13 systems while maintaining backward compatibility with CUDA 12 builds.

GrigoryEvko · 2025-11-07T02:13:38Z

@microsoft-github-policy-service agree

onnxruntime/__init__.py

setup.py

onnxruntime/__init__.py

setup.py

onnxruntime/__init__.py

setup.py

- Extract CUDA major version parsing into _extract_cuda_major_version() helper - Add _get_cufft_version() for CUDA-version-specific cufft mapping (11 for CUDA 12.x, 12 for CUDA 13.x) - Replace all try-except-pass blocks with contextlib.suppress() (fixes RUFF/SIM105) - Apply dynamic cufft version to both Windows and Linux DLL paths - Add contextlib import to setup.py and onnxruntime/__init__.py This eliminates code duplication and ensures correct library versions across CUDA versions.

…if blocks - Replace separate is_cuda_version_12/13 conditions with single dynamic block - Use cuda_major_version variable with f-strings for package names - Apply correct cufft version mapping (11.0 for CUDA 12, 12.0 for CUDA 13) - Works automatically for future CUDA versions

tianleiwu · 2025-11-08T21:18:49Z

Please format the code with lintrunner: https://github.com/microsoft/onnxruntime/blob/main/docs/Coding_Conventions_and_Standards.md#Linting

setup.py

onnxruntime/__init__.py

GrigoryEvko · 2025-11-08T23:15:16Z

Maybe this way? deduplicated some code

tianleiwu · 2025-11-09T06:12:35Z

/azp run Linux QNN CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows x64 QNN CI Pipeline

azure-pipelines · 2025-11-09T06:12:55Z

Azure Pipelines successfully started running 4 pipeline(s).

GrigoryEvko mentioned this pull request Nov 7, 2025

[Build] update cuda 13 package: fatbin compress mode and cuda archs #26516

Open

xadupre reviewed Nov 7, 2025

View reviewed changes

onnxruntime/__init__.py Outdated Show resolved Hide resolved

xadupre reviewed Nov 7, 2025

View reviewed changes

setup.py Outdated Show resolved Hide resolved

xadupre reviewed Nov 7, 2025

View reviewed changes

setup.py Outdated Show resolved Hide resolved

github-advanced-security bot found potential problems Nov 7, 2025

View reviewed changes

onnxruntime/__init__.py Fixed Show fixed Hide fixed

onnxruntime/__init__.py Fixed Show fixed Hide fixed

onnxruntime/__init__.py Fixed Show fixed Hide fixed

setup.py Fixed Show fixed Hide fixed

github-advanced-security bot found potential problems Nov 7, 2025

View reviewed changes

onnxruntime/__init__.py Fixed Show fixed Hide fixed

onnxruntime/__init__.py Fixed Show fixed Hide fixed

onnxruntime/__init__.py Fixed Show fixed Hide fixed

setup.py Fixed Show fixed Hide fixed

GrigoryEvko added 2 commits November 7, 2025 17:10

GrigoryEvko requested a review from xadupre November 7, 2025 21:33

tianleiwu reviewed Nov 8, 2025

View reviewed changes

setup.py Outdated Show resolved Hide resolved

tianleiwu reviewed Nov 8, 2025

View reviewed changes

onnxruntime/__init__.py Outdated Show resolved Hide resolved

removed duplication

f60b70e

GrigoryEvko requested a review from tianleiwu November 8, 2025 23:15

tianleiwu approved these changes Nov 9, 2025

View reviewed changes

tianleiwu merged commit 3c5f177 into microsoft:main Nov 9, 2025
90 checks passed

Fix CUDA version hardcoding to support CUDA 13+ dynamically #26518

Fix CUDA version hardcoding to support CUDA 13+ dynamically #26518

Uh oh!

Conversation

GrigoryEvko commented Nov 7, 2025

Description

Problem

Solution

Changes

Impact

Testing

Uh oh!

GrigoryEvko commented Nov 7, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tianleiwu commented Nov 8, 2025

Uh oh!

Uh oh!

Uh oh!

GrigoryEvko commented Nov 8, 2025

Uh oh!

tianleiwu commented Nov 9, 2025

Uh oh!

azure-pipelines bot commented Nov 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants