vllm
8ee90c83 - Add `--max-model-len auto` to auto-fit context to available memory (#29431)

Commit
131 days ago
Add `--max-model-len auto` to auto-fit context to available memory (#29431) Signed-off-by: mgoin <mgoin64@gmail.com>
Author
Parents
Loading