vllm
7c94ae16 - [BugFix] --max-model-len=-1 causes over-limit requests to hang and starve the entire service (#39102)

Commit
26 days ago
[BugFix] --max-model-len=-1 causes over-limit requests to hang and starve the entire service (#39102) Signed-off-by: triangle14 <y1019026570@gmail.com> Signed-off-by: mgoin <mgoin64@gmail.com> Co-authored-by: mgoin <mgoin64@gmail.com>
Author
Parents
Loading