vllm
64e3d67a
- Enable Cross layers KV cache layout at NIXL Connector (#30207)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
88 days ago
Enable Cross layers KV cache layout at NIXL Connector (#30207) Signed-off-by: Liran Schour <lirans@il.ibm.com> Signed-off-by: liranschour <liranschour@users.noreply.github.com> Co-authored-by: Or Ozeri <or@ozery.com>
References
#30207 - Enable Cross layers KV cache layout at NIXL Connector
Author
liranschour
Parents
098b2d66
Loading