llama.cpp
3f81b4e9 - vulkan: support GET_ROWS for k-quants (#16235)

Commit
12 days ago
vulkan: support GET_ROWS for k-quants (#16235) The dequantize functions are copy/pasted from mul_mm_funcs.comp with very few changes - add a_offset and divide iqs by 2. It's probably possible to call these functions from mul_mm_funcs and avoid the duplication, but I didn't go that far in this change.
Author
Parents
Loading