onnxruntime
Support 8 bit weights "unpacked" compute mode in MatmulNBits kernel
#24959
Merged

Support 8 bit weights "unpacked" compute mode in MatmulNBits kernel #24959

hariharans29
hariharans29 Support 8 bit weights unpacked mode compute in MatmulNBits kernel
9fbda7f4
hariharans29 Modify error message
e9b8e7fb
hariharans29 hariharans29 changed the title Support 8 bit weights unpacked mode compute in MatmulNBits kernel Support 8 bit weights "unpacked" compute mode in MatmulNBits kernel 294 days ago
hariharans29 hariharans29 requested a review from jywu-msft jywu-msft 294 days ago
hariharans29 hariharans29 requested a review from edgchen1 edgchen1 294 days ago
github-actions
github-actions commented on 2025-06-05
hariharans29 Update onnxruntime/contrib_ops/cpu/quantization/matmul_nbits.cc
e3bf04f7
hariharans29 Merge branch 'main' of https://github.com/microsoft/onnxruntime into …
39bbbbd5
hariharans29 Fix lint issues
8b2c6820
hariharans29 Merge branch 'hari/matmulnbits_8_bit_fallback' of https://github.com/…
64f7ac5a
edgchen1
edgchen1 commented on 2025-06-06
hariharans29 PR feedback
954069e4
hariharans29 Nit
0723529f
edgchen1
edgchen1 commented on 2025-06-11
edgchen1
edgchen1 dismissed these changes on 2025-06-11
hariharans29 Update onnxruntime/contrib_ops/cpu/quantization/matmul_nbits.cc
6966ff22
hariharans29 hariharans29 dismissed their stale review via 6966ff22 288 days ago
edgchen1
edgchen1 approved these changes on 2025-06-12
hariharans29 hariharans29 merged 3b855e1d into main 286 days ago
hariharans29 hariharans29 deleted the hari/matmulnbits_8_bit_fallback branch 286 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone