vllm
d765cf01 - [Core][Multimodal] Track encode cache entries by mm_hash and enable embedding sharing between requests (#22711)

Commit
161 days ago
[Core][Multimodal] Track encode cache entries by mm_hash and enable embedding sharing between requests (#22711) Signed-off-by: knlnguyen1802 <knlnguyen1802@gmail.com> Signed-off-by: Roger Wang <hey@rogerw.io> Co-authored-by: knlnguyen1802 <knlnguyen1802@gmail.com> Co-authored-by: Roger Wang <hey@rogerw.io>
Author
Parents
Loading