onnxruntime
Support 8 bit weights "unpacked" compute mode in MatmulNBits kernel
#24959

Merged

Support 8 bit weights "unpacked" compute mode in MatmulNBits kernel #24959

hariharans29 merged 9 commits into main from hari/matmulnbits_8_bit_fallback

Support 8 bit weights unpacked mode compute in MatmulNBits kernel

9fbda7f4

Modify error message

e9b8e7fb

hariharans29 changed the title ~~Support 8 bit weights unpacked mode compute in MatmulNBits kernel~~ Support 8 bit weights "unpacked" compute mode in MatmulNBits kernel 294 days ago

hariharans29 requested a review from

jywu-msft 294 days ago

hariharans29 requested a review from

edgchen1 294 days ago

github-actions commented on 2025-06-05

Update onnxruntime/contrib_ops/cpu/quantization/matmul_nbits.cc

e3bf04f7

Merge branch 'main' of https://github.com/microsoft/onnxruntime into …

39bbbbd5

Fix lint issues

8b2c6820

Merge branch 'hari/matmulnbits_8_bit_fallback' of https://github.com/…

64f7ac5a

edgchen1 commented on 2025-06-06

PR feedback

954069e4

Nit

0723529f

edgchen1 commented on 2025-06-11

edgchen1 dismissed these changes on 2025-06-11

Update onnxruntime/contrib_ops/cpu/quantization/matmul_nbits.cc

6966ff22

hariharans29 dismissed their stale review via 6966ff22 288 days ago

edgchen1 approved these changes on 2025-06-12

hariharans29 merged 3b855e1d into main 286 days ago

hariharans29 deleted the hari/matmulnbits_8_bit_fallback branch 286 days ago

Reviewers

edgchen1

github-actions

jywu-msft

Assignees

No one assigned

Labels

None yet

Milestone

No milestone

onnxruntime Support 8 bit weights "unpacked" compute mode in MatmulNBits kernel #24959 Merged

Support 8 bit weights "unpacked" compute mode in MatmulNBits kernel #24959

onnxruntime
Support 8 bit weights "unpacked" compute mode in MatmulNBits kernel
#24959

Merged