llama.cpp
b1f3a6e5 - llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization (#16653)

Commit
12 days ago
llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization (#16653) * llama: automatically fit args to free memory llama-fit-params tool * fix CI * hints for bug reports, ensure no reallocation * fix segfault with Vulkan * add llama-fit-params to CI * fix CI * fix CI * fix CI * minor adjustments * fix assignment of 1 dense layer * fix logger not being reset on model load failure * remove --n-gpu-layer hint on model load failure * fix llama-fit-params verbosity * fix edge case * fix typo [no ci]
Parents
Loading