llama.cpp
466300fe
- vulkan: optimize coopmat2 q4_k/q5_k dequant functions. (#11206)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
267 days ago
vulkan: optimize coopmat2 q4_k/q5_k dequant functions. (#11206) Do masking on whole dwords, fetch all scales at once.
References
#11206 - vulkan: optimize coopmat2 q4_k/q5_k dequant functions.
Author
jeffbolznv
Parents
206bc534
Loading