llama.cpp
CUDA: MMQ code deduplication + iquant support
#8495
Merged

Loading