llama.cpp
7883796f - quantize: be able to specify the output tensor type

Commit

2 years ago

quantize: be able to specify the output tensor type

References

#6239 - quantize: be able to explicitly specify quantization type of output and token embedding tensors

Author

Iwan Kawrakow

Parents

Loading