llama.cpp
fbeda900
- vulkan: matmul dequantization improvements (#12015)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
320 days ago
vulkan: matmul dequantization improvements (#12015) * faster dequant for old quants * dont use unpack for iq4_nl * vec2 unpack for q8
References
#12015 - vulkan: matmul dequantization improvements
Author
netrunnereve
Parents
581650b7
Loading