llama.cpp
0e826d12 - quantize: be able to specify the token embedding tensor type

Commit

1 year ago

quantize: be able to specify the token embedding tensor type

References

ik/quantize_not_repeating

#6239 - quantize: be able to explicitly specify quantization type of output and token embedding tensors

Author

Iwan Kawrakow

Parents

Loading