vllm
43936849 - [BugFix] Fix PP/async scheduling with pooling models (#28899)

Commit
22 days ago
[BugFix] Fix PP/async scheduling with pooling models (#28899) Signed-off-by: Nick Hill <nhill@redhat.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>
Author
Parents
Loading