llama.cpp
IQ4_NL sgemm + Q4_0 AVX optimization
#9422
Merged

IQ4_NL sgemm + Q4_0 AVX optimization #9422

ggerganov merged 4 commits into ggml-org:master from avx_optimizations
netrunnereve
netrunnereve squashed
6b780d82
netrunnereve shuffle
a201c6b5
netrunnereve remove f16c iq4_nl as i cant make it faster than before
a753b259
netrunnereve Merge branch 'ggerganov:master' into avx_optimizations
d635c75b
github-actions github-actions added ggml
ggerganov
ggerganov approved these changes on 2024-09-13
ggerganov ggerganov merged 5c3d0f18 into master 1 year ago
netrunnereve netrunnereve deleted the avx_optimizations branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone