onnxruntime
0b90363a
- [MLAS][AArch64] SQ4BitGemm CompInt8 multi-block implementation (#19826)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
[MLAS][AArch64] SQ4BitGemm CompInt8 multi-block implementation (#19826) Update SQ4BitGemm CompInt8 implementation to process multiple blocks along a single column instead of processing single blocks from multiple columns.
References
#19826 - [MLAS][AArch64] SQ4BitGemm CompInt8 multi-block implementation
Author
edgchen1
Parents
226f60f2
Loading