llama.cpp
f5a77a62 - Introduce C-style API (#370)

Commit
2 years ago
Introduce C-style API (#370) * Major refactoring - introduce C-style API * Clean up * Add <cassert> * Add <iterator> * Add <algorithm> .... * Fix timing reporting and accumulation * Measure eval time only for single-token calls * Change llama_tokenize return meaning
Author
Parents
Loading