llama.cpp
CUDA: MMQ support for iq4_nl, iq4_xs
#8278
Merged

Loading