flash attention FA4 blackwell on sm120? #10564

voipmonitor · 2025-09-17T10:10:18Z

voipmonitor
Sep 17, 2025

Hello,

is the new FA4 compatible with sm120 (RTX 6000 PRO, 5090 etc.) ?

THis is the PR: #9928

python3 -m sglang.launch_server
--model-path nvidia/DeepSeek-V3-0324-FP4
--tp 4 --attention-backend trtllm_mla
--moe-runner-backend flashinfer_trtllm
--quantization modelopt_fp4
--speculative-algorithm EAGLE --speculative-num-steps 3 --speculative-eagle-topk 1 --speculative-num-draft-tokens 4
--prefill-attention-backend fa4 --speculative-attention-mode decode

Edenzzzz · 2025-10-24T15:59:35Z

Edenzzzz
Oct 24, 2025

only sm100

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

flash attention FA4 blackwell on sm120? #10564

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

flash attention FA4 blackwell on sm120? #10564

Uh oh!

voipmonitor Sep 17, 2025

Replies: 1 comment

Uh oh!

Edenzzzz Oct 24, 2025

voipmonitor
Sep 17, 2025

Edenzzzz
Oct 24, 2025