ggml-cpu: FA add GEMM microkernel #19422
ggml-cpu: FA add GEMM microkernel
c8b9c839
add guard for sizeless vector types
a1e1420b
fix case where DV % GGML_F32_EPR !=0
734f76fb
am17an
force pushed
from
c4c451a7
to
734f76fb
100 days ago
move memset out of the loop
8debab31
ggerganov
approved these changes
on 2026-02-13
move another memset out of the loop
9c660dda
use RM=4 for arm
8d1be6c4
simd_gemm: convert everything to int
d473b671
am17an
force pushed
from
1b44835c
to
d473b671
99 days ago
convert everything to size_t to avoid warnings
c34b1a48
am17an
force pushed
from
27aa9286
to
c34b1a48
99 days ago
fixup
6aababd0
add pragma for ignoring aggressive loop optimizations
9de8ba9c
am17an
merged
684b3610
into master 97 days ago
am17an
deleted the opt-fa-micro-gemm branch 97 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub