llama.cpp
196f5083 - common : more accurate sampling timing (#17382)

Commit
111 days ago
common : more accurate sampling timing (#17382) * common : more accurate sampling timing * eval-callback : minor fixes * cont : add time_meas impl * cont : fix log msg [no ci] * cont : fix multiple definitions of time_meas * llama-cli : exclude chat template init from time measurement * cont : print percentage of unaccounted time * cont : do not reset timings
Author
Parents
Loading