vllm
8ee90c83
- Add `--max-model-len auto` to auto-fit context to available memory (#29431)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
131 days ago
Add `--max-model-len auto` to auto-fit context to available memory (#29431) Signed-off-by: mgoin <mgoin64@gmail.com>
References
#29431 - Add `--max-model-len auto` to auto-fit context to available memory
Author
mgoin
Parents
d7e05ac7
Loading