llama.cpp
IQ4_NL: 4-bit non-linear quants with blocks of 32
#5590
Merged

Loading