Skip to content

Attenion(23) CUDA #8198

Attenion(23) CUDA

Attenion(23) CUDA #8198

Triggered via pull request November 19, 2025 03:38
Status Failure
Total duration 34m 9s
Artifacts

windows_tensorrt.yml

on: pull_request
Windows GPU TensorRT CI Pipeline
30m 7s
Windows GPU TensorRT CI Pipeline
Windows GPU TensorRT CI Pipeline Test Job
0s
Windows GPU TensorRT CI Pipeline Test Job
Fit to window
Zoom out
Zoom in

Annotations

10 errors and 6 warnings
Windows GPU TensorRT CI Pipeline: onnxruntime/core/providers/cuda/llm/attention.cc#L96
'QKMatMulOutputMode': is not a class or namespace name
Windows GPU TensorRT CI Pipeline: onnxruntime/core/providers/cuda/llm/attention.cc#L65
syntax error: missing ';' before identifier 'parameters'
Windows GPU TensorRT CI Pipeline: onnxruntime/core/providers/cuda/llm/attention.cc#L65
'AttentionParameters': undeclared identifier
Windows GPU TensorRT CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1234
epilog offset from end of function exceeds 4095
Windows GPU TensorRT CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1227
epilog offset from end of function exceeds 4095
Windows GPU TensorRT CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1220
epilog offset from end of function exceeds 4095
Windows GPU TensorRT CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1213
epilog offset from end of function exceeds 4095
Windows GPU TensorRT CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1206
epilog offset from end of function exceeds 4095
Windows GPU TensorRT CI Pipeline: onnxruntime/core/mlas/lib/amd64/QgemmU8X8KernelAvx2.asm#L1199
epilog offset from end of function exceeds 4095