llama.cpp
669696e0 - ggml-cpu: ARM64: repack version of q8_0 (dotprod and i8mm) (#18096)

Commit
30 days ago
ggml-cpu: ARM64: repack version of q8_0 (dotprod and i8mm) (#18096) * wip: skeleton for q8_0 repack * q8_0 repack GEMV implementations * GEMM implementations * Formatting * Fixed format consistency of repack gemm and gemv declarations * gemv and gemm generic location consistent with declarations * Removed non-correct unused variables statements * Cleanup, consistent style * Missing generic fallbacks for x86 and powerpc
Author
Parents
Loading