llama.cpp
63fd76fb
- Reduce model loading time (#43)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
Reduce model loading time (#43) * Use buffering * Use vector * Minor --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
References
#43 - Reduce model loading time
Author
maekawatoshiki
Parents
2a20f48e
Loading