llama.cpp
fbeda900 - vulkan: matmul dequantization improvements (#12015)

Commit
320 days ago
vulkan: matmul dequantization improvements (#12015) * faster dequant for old quants * dont use unpack for iq4_nl * vec2 unpack for q8
Author
Parents
Loading