vllm
f1531d9f
- [Hybrid] Mamba2 prefix cache blocks freeing for running requests (#28047)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
54 days ago
[Hybrid] Mamba2 prefix cache blocks freeing for running requests (#28047) Signed-off-by: Stanislaw Wozniak <stw@zurich.ibm.com> Signed-off-by: Chen Zhang <zhangch99@outlook.com> Co-authored-by: Chen Zhang <zhangch99@outlook.com>
References
#28047 - [Hybrid] Mamba2 prefix cache blocks freeing for running requests
Author
s3woz
Parents
2d6001f4
Loading