llama.cpp
4cb208c9 - vulkan: coopmat2 mul_mat optimizations (#14934)

Commit
36 days ago
vulkan: coopmat2 mul_mat optimizations (#14934) - Increase tile size for k-quants, to match non-k-quants - Choose more carefully between large and medium tiles, considering how it interacts with split_k - Allow larger/non-power of two split_k, and make the splits a multiple of 256 - Use split_k==3 to when >1/2 and <=2/3 of the SMs would hae been used
Author
Parents
Loading