llama.cpp
7883796f
- quantize: be able to specify the output tensor type
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
quantize: be able to specify the output tensor type
References
#6239 - quantize: be able to explicitly specify quantization type of output and token embedding tensors
Author
Iwan Kawrakow
Parents
b2075fd6
Loading