PR #9422 IQ4_NL sgemm + Q4_0 AVX optimization

squashed

netrunnereve committed 1 year ago

shuffle

netrunnereve committed 1 year ago

remove f16c iq4_nl as i cant make it faster than before

netrunnereve committed 1 year ago

Merge branch 'ggerganov:master' into avx_optimizations

netrunnereve committed 1 year ago

llama.cpp IQ4_NL sgemm + Q4_0 AVX optimization #9422 Merged