llama.cpp
2777a84b - llama : quantize up to 31% faster on Linux and Windows with mmap (#3206)

Commit
2 years ago
llama : quantize up to 31% faster on Linux and Windows with mmap (#3206) * llama : enable mmap in quantize on Linux -> 31% faster * also enable mmap on Windows --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Author
Parents
Loading