llama.cpp
cd4a7c4c - Make quantize_row_iq4_nl do the same thing is quantization on CUDA

Commit

1 year ago

Make quantize_row_iq4_nl do the same thing is quantization on CUDA

References

#6196 - Make IQ4_NL quantization be the same on CPU/CUDA/Metal when quantizing K-cache

Author

Iwan Kawrakow

Parents

Loading