onnxruntime
MatMulNBits + Add fusion
#20587
Merged

MatMulNBits + Add fusion #20587

edgchen1 merged 30 commits into main from edgchen1/matmul_nbits_bias
edgchen1
edgchen1 save work
350f3e20
edgchen1 handle bias in fallback impl
752b7c95
edgchen1 reorder includes
4121ff44
edgchen1 save work - initial impl of optimizer
96fa18b4
edgchen1 Merge remote-tracking branch 'origin/main' into edgchen1/matmul_nbits…
11c3b922
edgchen1 Add test, refine impl.
baab944a
edgchen1 Merge remote-tracking branch 'origin/main' into edgchen1/matmul_nbits…
15687585
edgchen1 fix fusion, update test
6402ea5d
edgchen1 Enable bias in SQNBitGemm benchmark.
6a1feb28
edgchen1 clean up
a6c42e41
edgchen1 move test functions around to avoid unused function warnings
7fa290b7
edgchen1 add /bigobj for graph_tarnsform_test.cc
b5057eb4
edgchen1 lint
823c9db2
edgchen1 Add Neural Speed build to post-merge-jobs.yml.
7555f92f
edgchen1 move graph_utils::IsSupportedProvider() in header so it is available …
10ba1db0
edgchen1 put MatMulNBitsBiasFusion test in contrib ops ifdef
471429c3
edgchen1 update ContribOperators.md
7f07d93c
edgchen1 call RunWithConfig instead of Run
97a15a12
edgchen1 update OperatorKernels.md
905cbd22
edgchen1 add matmul_nbits_fusion files to extended minimal build source
0abc2cd7
edgchen1 Merge remote-tracking branch 'origin/main' into edgchen1/matmul_nbits…
1a4b5a0f
edgchen1 Make DML MatMulNBits input check accomodate new inputs.
50cb266d
edgchen1 expect input arg counts of 0 for missing optional inputs
131141f3
edgchen1 Merge branch 'edgchen1/matmul_nbits_bias' of github.com:microsoft/onn…
2417b802
edgchen1 add runtime optimization test for matmulnbits and add fusion
1d9c2f50
edgchen1 edgchen1 marked this pull request as ready for review 2 years ago
edgchen1 edgchen1 requested a review 2 years ago
github-advanced-security
github-advanced-security commented on 2024-05-10
github-advanced-security
github-advanced-security commented on 2024-05-10
edgchen1 edgchen1 requested a review from skottmckay skottmckay 2 years ago
edgchen1 edgchen1 requested a review from liqunfu liqunfu 2 years ago
edgchen1 edgchen1 requested a review from yufenglee yufenglee 2 years ago
edgchen1 add test onnx file
afacb57c
edgchen1 lint
55851e33
github-advanced-security
github-advanced-security commented on 2024-05-10
yufenglee
yufenglee commented on 2024-05-14
yufenglee
yufenglee commented on 2024-05-14
edgchen1 Add !ORT_NEURAL_SPEED ifdefs around adding MatMulNBitsFusion transfor…
badcd0fa
edgchen1 add ifdef around test
b90c8cab
yufenglee
yufenglee dismissed these changes on 2024-05-15
edgchen1 Merge branch 'main' into edgchen1/matmul_nbits_bias
92ce6ebb
edgchen1 edgchen1 dismissed their stale review via 92ce6ebb 2 years ago
skottmckay
skottmckay approved these changes on 2024-05-16
edgchen1 edgchen1 merged e81c8676 into main 2 years ago
edgchen1 edgchen1 deleted the edgchen1/matmul_nbits_bias branch 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone