vllm
d9f83d62
- [ROCm] Enable chunked prefill/paged attention in MLA on ROCm (#14316)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
276 days ago
[ROCm] Enable chunked prefill/paged attention in MLA on ROCm (#14316) Signed-off-by: Sage Moore <sage@neuralmagic.com>
References
#14316 - [ROCm] Enable chunked prefill/paged attention in MLA on ROCm
Author
SageMoore
Parents
4a754fcf
Loading