llama.cpp
466300fe - vulkan: optimize coopmat2 q4_k/q5_k dequant functions. (#11206)

Commit
267 days ago
vulkan: optimize coopmat2 q4_k/q5_k dequant functions. (#11206) Do masking on whole dwords, fetch all scales at once.
Author
Parents
Loading