[x86] matmulnbit x64 kernel for 8bits #24491
liqunfu
dismissed these changes
on 2025-04-21
fajin-corp
dismissed their stale review
via 33af3895
319 days ago
added quant8 interface
75d4ab55
added q8 packb and blocksum
d5ef23ec
added interface for sq8 int8 matmul
71d1f1cd
fix prepack stride
caf4737c
finished q8 matmul m2 n4
ced7c66f
finished Q8Int8GemmR2xC1BlkLen16Avx2
fc3e9799
finished Q8Int8GemmR2xC1BlkLen16Avx2
517b9891
finished block16 avx2/vnni
e5da9d40
finished Q8Int8GemmR2xC4BlkLen32Avx2
d270008a
finished Q8Int8GemmR2xC1BlkLen32Avx2
4d45bd84
finished Q8Int8GemmR1xC4BlkLen32Avx2
13097f0b
finished Q8Int8GemmR1xC1BlkLen32Avx2
1287666a
finished block64 avx2/vnni
8e24a764
added avx512/vnni kernel interface
04071b30
finished Q8Int8GemmR2xC4BlkLen16Avx512
23cb557a
finished MlasQ8Int8GemmKernelBlkLen16Avx512
1548d95d
finished MlasQ8Int8GemmKernelBlkLen32Avx512
d0df455d
finished MlasQ8Int8GemmKernelBlkLen64Avx512
d7ae11bc
finished MlasQ8Int8GemmKernelBlkLen128Avx512
702962fe
fixed 512 vnni build
30118126
added prepack ut
02f74e44
added avx flags
89a38948
finished ut
6c64086f
passed prepack ut
d17c840c
fixing gemm ut
4cdf43bd
passed gemm vnni ut
b3ec1648
fixed ut
8d5abe11
added benchmark
f99ebffc
fixed avx2vnni ut
f11de12a
debugging ut
1cb47c8e
passed matmulnbits op ut
d6a15050
fix linting
8ab12ccf
try to fix ci
a9f186af
fajin-corp
force pushed
from
066e0731
to
a9f186af
319 days ago
fix gcc pragma
cb4398d6
resolving ut build error
ac17924a
fix ut size
ede8f759
fix linting
a5332fd8
reduce test count
809e8dbc
configure cpu ep for ut
367c5685
fix dml and webgpu error
30cbf729
fix coreml ut
daf30649
fix linting
5e973df8
reduce test count
159f7f1c
update comments
d99c6f68
liqunfu
approved these changes
on 2025-04-24
fajin-corp
deleted the fajin/matmul8bit_x64_kernel branch 317 days ago
snnn
removed release:1.22.0
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub