vllm
ee659e3b - [Bugfix][ROCm] Use `chunked_prefill_paged_decode` as fallback for V1 attention on ROCm (#18093)

Commit
216 days ago
[Bugfix][ROCm] Use `chunked_prefill_paged_decode` as fallback for V1 attention on ROCm (#18093) Signed-off-by: kf <kuanfu.liu@embeddedllm.com>
Author
Parents
Loading