Add CUDA memory optimization for long-context GQA attention #7853
This workflow is awaiting approval from a maintainer in #26658
Triggered via pull request
November 28, 2025 05:59
Status
Action required
Total duration
–
Artifacts
–
This workflow is awaiting approval from a maintainer in #26658
web.yml
on: pull_request
precheck
wasm_Release_static_library
/
build-wasm
web_Debug
/
build_onnxruntime_web
web_Release
/
build_onnxruntime_web