ONNX Runtime CUDA Builds

qgemm: optimize avxvnni QGEMM inner kernel for M=1 #8130

Sign in to view logs

Re-run triggered November 14, 2025 20:58

#22952

r-devulap:avxvnni-unroll

Status Success

Total duration 2h 19m 22s

Artifacts 1

windows_cuda.yml

on: pull_request

Windows GPU CUDA CI Pipeline

Windows GPU CUDA CI Pipeline Test Job

Annotations

6 warnings

Windows GPU CUDA CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1234

epilog offset from end of function exceeds 4095

Windows GPU CUDA CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1227

epilog offset from end of function exceeds 4095

Windows GPU CUDA CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1220

epilog offset from end of function exceeds 4095

Windows GPU CUDA CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1213

epilog offset from end of function exceeds 4095

Windows GPU CUDA CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1206

epilog offset from end of function exceeds 4095

Windows GPU CUDA CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1199

epilog offset from end of function exceeds 4095

Artifacts

Produced during runtime

Name	Size	Digest
build-artifacts	2.02 GB	`sha256:bb655f7826dc74355195960301b029bfc5347d7a869410cb7e8026bd4697300f`