vllm
653124bd - [Frontend] Add chunked processing to handle long inputs in embedding models (#22280)

Commit
146 days ago
[Frontend] Add chunked processing to handle long inputs in embedding models (#22280) Signed-off-by: x22x22 <wadeking@qq.com> Signed-off-by: Kdump <rootshellexp@gmail.com> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by: Maximilien de Bayser <maxdebayser@gmail.com> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>
Author
Parents
Loading