vllm
e24e0a43 - [Attention] relax the head dim 512 and paged kv for sm90+FA4 (#38835)

Commit
20 days ago
[Attention] relax the head dim 512 and paged kv for sm90+FA4 (#38835) Signed-off-by: Siyuan Fu <siyuanf@nvidia.com> Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by: Lucas Wilkinson <lwilkins@redhat.com>
Author
Parents
Loading