llama.cpp
3015851c - llama : add getters for n_threads/n_threads_batch (#7464)

Commit

1 year ago

llama : add getters for n_threads/n_threads_batch (#7464) * llama : add getters for n_threads/n_threads_batch This commit adds two new functions to the llama API. The functions can be used to get the number of threads used for generating a single token and the number of threads used for prompt and batch processing (multiple tokens). The motivation for this is that we want to be able to get the number of threads that the a context is using. The main use case is for a testing/verification that the number of threads is set correctly. Signed-off-by: Daniel Bevenius <daniel.bevenius@gmail.com> * squash! llama : add getters for n_threads/n_threads_batch Rename the getters to llama_n_threads and llama_n_threads_batch. Signed-off-by: Daniel Bevenius <daniel.bevenius@gmail.com> --------- Signed-off-by: Daniel Bevenius <daniel.bevenius@gmail.com>

References

#7464 - llama : add getters for n_threads/n_threads_batch

Author

danbev

Parents

55ac3b7a

llama.cpp 3015851c - llama : add getters for n_threads/n_threads_batch (#7464)

llama.cpp
3015851c - llama : add getters for n_threads/n_threads_batch (#7464)