vllm
d8bebb00
- Add tests for chunked prefill and prefix cache with causal pooling models (#26526)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
110 days ago
Add tests for chunked prefill and prefix cache with causal pooling models (#26526) Signed-off-by: Max de Bayser <mbayser@br.ibm.com> Co-authored-by: Ayush Singh <ayush1009208@gmail.com>
References
#26526 - Add tests for chunked prefill and prefix cache with causal pooling models
Author
maxdebayser
Parents
35bc22f2
Loading