vllm
ee659e3b
- [Bugfix][ROCm] Use `chunked_prefill_paged_decode` as fallback for V1 attention on ROCm (#18093)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
216 days ago
[Bugfix][ROCm] Use `chunked_prefill_paged_decode` as fallback for V1 attention on ROCm (#18093) Signed-off-by: kf <kuanfu.liu@embeddedllm.com>
References
#18093 - [Bugfix][ROCm] Use `chunked_prefill_paged_decode` as fallback for V1 attention on ROCm
Author
kliuae
Parents
4e1c6a02
Loading