llama.cpp
2777a84b
- llama : quantize up to 31% faster on Linux and Windows with mmap (#3206)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
llama : quantize up to 31% faster on Linux and Windows with mmap (#3206) * llama : enable mmap in quantize on Linux -> 31% faster * also enable mmap on Windows --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
References
#3206 - llama : quantize up to 31% faster on Linux with mmap
Author
cebtenzzre
Parents
0a4a4a09
Loading