metal : add Q8_0 support #2763
metal : add dequantize_q8_0 kernel
46a0881c
metal : add mul_mat_q8_0_f32 kernel
61c8259a
metal : add Q8_0 mul_mm kernel
1202e06c
ggerganov
marked this pull request as ready for review 2 years ago
lshzh-ww
approved these changes
on 2023-08-24
ggerganov
merged
d67777c2
into master 2 years ago
ggerganov
deleted the metal-add-q8_0 branch 2 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub