onnxruntime
Enable 2bit CPU matmul fallback
#25582
Merged

Enable 2bit CPU matmul fallback #25582

carzh merged 11 commits into main from carzh/enable-2bit-fallback
carzh
carzh enable fallback + switch to nbits quantizer
90f7c26e
carzh fix for quantizer script
e970f745
carzh figured out bug so uncommenting enforce line
ef145419
HectorSVC
HectorSVC commented on 2025-07-29
carzh 2bit quant script fix
69adef03
carzh added working matmul 2bits unit test
b591647b
github-actions
github-actions commented on 2025-07-30
github-advanced-security
github-advanced-security commented on 2025-07-30
carzh carzh marked this pull request as ready for review 201 days ago
carzh cleanup + lintrunner sigh
02c9ae39
carzh test wip for quantizer python
c38e8a45
github-advanced-security
github-advanced-security commented on 2025-07-30
github-advanced-security
github-advanced-security commented on 2025-07-30
hariharans29
hariharans29 commented on 2025-07-30
hariharans29
hariharans29 commented on 2025-07-30
hariharans29
hariharans29 commented on 2025-07-30
carzh fixed 4bit matmulnbitsquantizer.py test, applied suggestions (fixed c…
c4cf9b14
HectorSVC
HectorSVC commented on 2025-08-01
HectorSVC
HectorSVC commented on 2025-08-01
carzh updated test_op_matmul_2bits.py
f1d6470c
carzh removed gather tests as well
7246934c
github-actions
github-actions commented on 2025-08-01
github-advanced-security
github-advanced-security commented on 2025-08-01
HectorSVC format
6f10276c
HectorSVC
HectorSVC approved these changes on 2025-08-04
hariharans29
hariharans29 approved these changes on 2025-08-04
carzh carzh merged d5d3b287 into main 196 days ago
carzh carzh deleted the carzh/enable-2bit-fallback branch 196 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone