llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization #16653
JohannesGaessler
changed the title llama: automatically fit parameters not set by the user to free device memory llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization 87 days ago
CISC
commented
on 2025-12-04
ggerganov
approved these changes
on 2025-12-10
llama: automatically fit args to free memory
532be323
fix CI
8a2f3f50
hints for bug reports, ensure no reallocation
f6987d4d
fix segfault with Vulkan
c93ed495
add llama-fit-params to CI
4eac35d5
fix CI
90ba3378
fix CI
17739af3
fix CI
f4986bf7
minor adjustments
9d0a0bb4
fix assignment of 1 dense layer
97820aa7
fix logger not being reset on model load failure
c963908c
remove --n-gpu-layer hint on model load failure
7dcabc65
fix llama-fit-params verbosity
9faca5a9
ggerganov
approved these changes
on 2025-12-14
fix edge case
ae534ec0
fix typo [no ci]
c1bb7c03
Assignees
No one assigned
Labels
examples
devops
ggml
Login to write a write a comment.
Login via GitHub