llama.cpp
ggml-cuda: Add generic NVFP4 MMQ kernel
#21074
Merged

Loading