vllm
cf93c1a1
- [ROCm][AITER] Fix aiter paged_attention_v1 decode for sliding window and head_size < 64 (#34570)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
4 days ago
[ROCm][AITER] Fix aiter paged_attention_v1 decode for sliding window and head_size < 64 (#34570) Signed-off-by: Andreas Karatzas <akaratza@amd.com>
References
#34570 - [ROCm][AITER] Fix aiter paged_attention_v1 decode for sliding window and head_size < 64
Author
AndreasKaratzas
Parents
89358f0d
Loading