vllm
1d35662e - [ROCm] Disable chunked prefill/prefix caching when running MLA on non-cuda platforms (#13844)

Commit
297 days ago
[ROCm] Disable chunked prefill/prefix caching when running MLA on non-cuda platforms (#13844) Signed-off-by: Sage Moore <sage@neuralmagic.com>
Author
Parents
Loading