Skip to content

Add CUDA memory optimization for long-context GQA attention #7309

Add CUDA memory optimization for long-context GQA attention

Add CUDA memory optimization for long-context GQA attention #7309

This workflow is awaiting approval from a maintainer in #26658
Triggered via pull request November 28, 2025 05:59
Status Action required
Total duration
Artifacts
This workflow is awaiting approval from a maintainer in #26658

linux_openvino_ci.yml

on: pull_request
Build and Test OpenVINO EP (AlamLinux8, Py3.12)  /  build_test_pipeline
Build and Test OpenVINO EP (AlamLinux8, Py3.12) / build_test_pipeline
Fit to window
Zoom out
Zoom in