llama.cpp
IQ4_NL sgemm + Q4_0 AVX optimization
#9422
Merged

Commits
  • squashed
    netrunnereve committed 1 year ago
  • shuffle
    netrunnereve committed 1 year ago
  • remove f16c iq4_nl as i cant make it faster than before
    netrunnereve committed 1 year ago
  • Merge branch 'ggerganov:master' into avx_optimizations
    netrunnereve committed 1 year ago
Loading