llama.cpp
add avx2 for dot_q8_0_q8_0, 2x faster than scalar
#1211
Merged

add avx2 for dot_q8_0_q8_0, 2x faster than scalar #1211

YannFollet
YannFollet add avx2 for dot_q8_0_q8_0, 2x faster than scalar
e309138d
sw
sw approved these changes on 2023-04-28
sw sw merged 04aaae1d into master 2 years ago
YannFollet YannFollet deleted the dot_q8_0_q8_0_avx2 branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone