onnxruntime
150c4cb8 - [MLAS AArch64] SQNBitGemm CompInt8 kernel (#18953)

Commit
1 year ago
[MLAS AArch64] SQNBitGemm CompInt8 kernel (#18953) Implement ARM NEON SQNBitGemm kernel that first block quantizes A to int8 and then does int8 multiplication.
Author
Parents
Loading