llama.cpp
ggml webgpu: faster normal quant and some k-quant matrix operations, better shader parameter handling
#20173
Merged

Loading