mgoin
changed the title Auto-fit max_model_len Add `--max-model-len -1` to auto-fit context length to GPU memory167 days ago
mgoin
changed the title Add `--max-model-len -1` to auto-fit context length to GPU memory Add `--max-model-len -1` to auto-fit context to available memory167 days ago
mgoin
changed the title Add `--max-model-len -1` to auto-fit context to available memory Add `--max-model-len auto` to auto-fit context to available memory167 days ago
Login to write a write a comment.
Login via GitHub