onnxruntime
67093b20 - Clean up aarch64 quantized GEMM dispatch (#16120)

Commit
2 years ago
Clean up aarch64 quantized GEMM dispatch (#16120) ### Description - Add a new field to `MLAS_PLATFORM` for S8S8 GEMM dispatch. - Set this field to either dot product instructions or NEON MLA in platform.cpp. - Clean up dispatch selector in qgemm.h. ### Motivation and Context This will allow future extensibility as other functions that use other ARM64 extensions for quantized matrix multiplication. --------- Co-authored-by: Skand Hurkat <skhurkat@microsoft.com>
Author
Parents
Loading