llama.cpp
63fd76fb - Reduce model loading time (#43)

Commit
2 years ago
Reduce model loading time (#43) * Use buffering * Use vector * Minor --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Parents
Loading