llama.cpp
cb40dfca - llama : only use Q6_K for output weights if tensor size is multiple of 256 (#1932)

Commit
2 years ago
llama : only use Q6_K for output weights if tensor size is multiple of 256 (#1932) * Only use Q6_K for output weights if tensor size is multiple of 256 * Fixed copy/paste mistake --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Author
Parents
Loading