vllm
e24e0a43
- [Attention] relax the head dim 512 and paged kv for sm90+FA4 (#38835)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
20 days ago
[Attention] relax the head dim 512 and paged kv for sm90+FA4 (#38835) Signed-off-by: Siyuan Fu <siyuanf@nvidia.com> Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by: Lucas Wilkinson <lwilkins@redhat.com>
References
#38835 - [Attention] relax the head dim 512 and paged kv for sm90+FA4
Author
IwakuraRein
Parents
b55d830e
Loading