vllm
[Attention] relax the head dim 512 and paged kv for sm90+FA4
#38835
Merged

[Attention] relax the head dim 512 and paged kv for sm90+FA4 #38835

IwakuraRein
IwakuraRein IwakuraRein requested a review from LucasWilkinson LucasWilkinson 38 days ago
IwakuraRein IwakuraRein requested a review from MatthewBonanni MatthewBonanni 38 days ago
mergify mergify added v1
IwakuraRein IwakuraRein changed the title relax the head dim 512 and paged kv for sm90+FA4 [Attention] relax the head dim 512 and paged kv for sm90+FA4 38 days ago
gemini-code-assist
gemini-code-assist commented on 2026-04-02
mergify mergify added ci/build
IwakuraRein relax the head dim 512 and paged kv for sm90+FA4
f38f4fde
mergify
mergify mergify added needs-rebase
IwakuraRein update the vllm-flash-attn commit
75ea3b36
IwakuraRein IwakuraRein force pushed to 75ea3b36 38 days ago
mergify mergify removed needs-rebase
IwakuraRein update vllm-flash-attn git repo; TODO: revert after the vllm-flash-at…
5527ed33
LucasWilkinson Advertise head_dim 512 support when FA4 is available and force FA4 fo…
694930b5
IwakuraRein point vllm-flash-attn to ToT
b04cdd20
LucasWilkinson
LucasWilkinson approved these changes on 2026-04-03
LucasWilkinson LucasWilkinson added ready
LucasWilkinson LucasWilkinson enabled auto-merge (squash) 36 days ago
LucasWilkinson Merge branch 'main' into update-sm90-fa4
6e1b13ad
IwakuraRein Merge branch 'main' into update-sm90-fa4
1752db0a
IwakuraRein Merge branch 'main' into update-sm90-fa4
0a82e28c
IwakuraRein Merge branch 'main' into update-sm90-fa4
6a52051c
IwakuraRein Merge branch 'main' into update-sm90-fa4
ad526440
LucasWilkinson LucasWilkinson merged e24e0a43 into main 32 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone