[Attention] relax the head dim 512 and paged kv for sm90+FA4 #38835
IwakuraRein
changed the title relax the head dim 512 and paged kv for sm90+FA4 [Attention] relax the head dim 512 and paged kv for sm90+FA4 38 days ago
relax the head dim 512 and paged kv for sm90+FA4
f38f4fde
update the vllm-flash-attn commit
75ea3b36
update vllm-flash-attn git repo; TODO: revert after the vllm-flash-at…
5527ed33
Advertise head_dim 512 support when FA4 is available and force FA4 fo…
694930b5
point vllm-flash-attn to ToT
b04cdd20
Merge branch 'main' into update-sm90-fa4
6e1b13ad
Merge branch 'main' into update-sm90-fa4
1752db0a
Merge branch 'main' into update-sm90-fa4
0a82e28c
Merge branch 'main' into update-sm90-fa4
6a52051c
Merge branch 'main' into update-sm90-fa4
ad526440
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub