llama.cpp
1d0331c1 - quantize: options for output and token embedding tensors qtype (#6239)

Commit
1 year ago
quantize: options for output and token embedding tensors qtype (#6239) * quantize: be able to specify the output tensor type * quantize: be able to specify the token embedding tensor type --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Author
Parents
Loading