whisper.cpp
97bd65f9 - vulkan: support GET_ROWS for k-quants (llama/16235)

Commit
253 days ago
vulkan: support GET_ROWS for k-quants (llama/16235) The dequantize functions are copy/pasted from mul_mm_funcs.comp with very few changes - add a_offset and divide iqs by 2. It's probably possible to call these functions from mul_mm_funcs and avoid the duplication, but I didn't go that far in this change.
Author
Committer
Parents
Loading