onnxruntime
0b90363a - [MLAS][AArch64] SQ4BitGemm CompInt8 multi-block implementation (#19826)

Commit
1 year ago
[MLAS][AArch64] SQ4BitGemm CompInt8 multi-block implementation (#19826) Update SQ4BitGemm CompInt8 implementation to process multiple blocks along a single column instead of processing single blocks from multiple columns.
Author
Parents
Loading