llama.cpp
opencl: tiled mul_mat with local memory for f16 and f32
#14809
Merged

Loading