llama.cpp
0e826d12
- quantize: be able to specify the token embedding tensor type
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
quantize: be able to specify the token embedding tensor type
References
ik/quantize_not_repeating
#6239 - quantize: be able to explicitly specify quantization type of output and token embedding tensors
Author
Iwan Kawrakow
Parents
7883796f
Loading