llama.cpp
≈65% speedup of the AVX-512 implementation of `ggml_vec_dot_q4_0()`
#933
Merged

≈65% speedup of the AVX-512 implementation of `ggml_vec_dot_q4_0()` #933

dfyz merged 1 commit into ggml-org:master from dfyz:master
dfyz
dfyz dfyz closed this 2 years ago
dfyz dfyz force pushed to 0e07e6a8 2 years ago
dfyz
dfyz dfyz reopened this 2 years ago
ggerganov ggerganov added performance
ggerganov ggerganov added high priority
dfyz
KASR
unbounded
dfyz
dfyz
dfyz
ggerganov
ggerganov approved these changes on 2023-04-15
ultoris
dfyz dfyz force pushed from d787348b 2 years ago
dfyz
dfyz
ultoris
0x131315
dfyz dfyz force pushed to 6a4fa4d9 2 years ago
dfyz
dfyz dfyz merged f266259a into master 2 years ago
dfyz Speedup the AVX-512 implementation of ggml_vec_dot_q4_0()
6a4fa4d9

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone