vllm
1d35662e
- [ROCm] Disable chunked prefill/prefix caching when running MLA on non-cuda platforms (#13844)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
297 days ago
[ROCm] Disable chunked prefill/prefix caching when running MLA on non-cuda platforms (#13844) Signed-off-by: Sage Moore <sage@neuralmagic.com>
References
#13844 - [ROCm] Disable chunked prefill/prefix caching when running MLA on non-cuda platforms
Author
SageMoore
Parents
e656f638
Loading