llama.cpp
d67777c2
- metal : add Q8_0 support (#2763)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Hide Minimap (CTRL+M)
Commit
1 year ago
metal : add Q8_0 support (#2763) * metal : add dequantize_q8_0 kernel * metal : add mul_mat_q8_0_f32 kernel * metal : add Q8_0 mul_mm kernel
References
#2763 - metal : add Q8_0 support
Author
ggerganov
Parents
c3e53b42
Files
2
ggml-metal.m
ggml-metal.metal
Loading