llama.cpp
cb40dfca
- llama : only use Q6_K for output weights if tensor size is multiple of 256 (#1932)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
llama : only use Q6_K for output weights if tensor size is multiple of 256 (#1932) * Only use Q6_K for output weights if tensor size is multiple of 256 * Fixed copy/paste mistake --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
References
#1932 - Only use Q6_K for output weights if tensor size is multiple of 256
Author
ikawrakow
Parents
ca7c3f4d
Loading