onnxruntime
43142a82 - [Nuphar] added Gemm-to-MatMul conversion in model editor (#4691)

Commit

5 years ago

[Nuphar] added Gemm-to-MatMul conversion in model editor (#4691) * [Nuphar] added Gemm-to-MatMul conversion in model editor * added a mode gemm_to_matmul that turns Gemm Ops into MatMul Ops * enabled model_quantizer to quantize MatMul inside a Loop op * this PR also included Gemm-11 support from Ke Deng * Fixed a couple of existing bugs Fixed a couple of old bugs exposed by the newly-added tests and the support of Gemm-11, including: * correctly handle aliasing among states and outputs in Scan * fixed a transpose issue in building tvm IR for MatMul * fixed an issue related to generating IR for computing Gemm alpha * disabled several tests that triggered some deep issue (likely) in the graph partitioner. I think it might be better to have a separate PR to address the issue.

References

#4691 - [Nuphar] added Gemm-to-MatMul conversion in model editor

Author

yangchen-MS

Parents

5c5efa90

onnxruntime 43142a82 - [Nuphar] added Gemm-to-MatMul conversion in model editor (#4691)

onnxruntime
43142a82 - [Nuphar] added Gemm-to-MatMul conversion in model editor (#4691)