vllm
7c94ae16
- [BugFix] --max-model-len=-1 causes over-limit requests to hang and starve the entire service (#39102)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
26 days ago
[BugFix] --max-model-len=-1 causes over-limit requests to hang and starve the entire service (#39102) Signed-off-by: triangle14 <y1019026570@gmail.com> Signed-off-by: mgoin <mgoin64@gmail.com> Co-authored-by: mgoin <mgoin64@gmail.com>
References
#39102 - [BugFix] --max-model-len=-1 causes over-limit requests to hang and starve the entire service
Author
triangleXIV
Parents
ad05edfb
Loading