onnxruntime
5d8c5409 - POWER10: QGEMM optimization (#10642)

Commit
4 years ago
POWER10: QGEMM optimization (#10642) * POWER10: QGEMM optimization This patch makes use of POWER10 MMA feature for QGEMM function. This optimization includes signed and unsigned cases.Tested and there are no new failures with gcc11 and clang-14. * Changes as per review comments Co-authored-by: Rajalakshmi Srinivasaraghavan <rajis@linux.ibm.com>
Author
Parents
Loading