llama.cpp
58d07a80
- metal : copy kernels for quant to F32/F16 conversions (#12017)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
346 days ago
metal : copy kernels for quant to F32/F16 conversions (#12017) metal: use dequantize_q templates --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
References
#12017 - metal: Copy kernels for quant to F32 conversions (#10976).
Author
gcp
Parents
34a846b5
Loading