onnxruntime
[CPU] Add 8bit support to matmulnbits quantizer
#24384
Merged

[CPU] Add 8bit support to matmulnbits quantizer #24384

jiafatom merged 14 commits into main from fajin/matmulnbit8bit_quantizer
fajin-corp
fajin-corp fajin-corp requested a review 256 days ago
fajin-corp fajin-corp assigned fajin-corp fajin-corp 256 days ago
fajin-corp fajin-corp assigned liqunfu liqunfu 256 days ago
liqunfu
liqunfu
liqunfu commented on 2025-04-10
liqunfu
liqunfu commented on 2025-04-10
liqunfu
liqunfu commented on 2025-04-10
jiafatom
jiafatom commented on 2025-04-10
jiafatom
liqunfu
liqunfu
liqunfu dismissed these changes on 2025-04-11
fajin-corp init
16a3a8e2
fajin-corp finished quantizeAndTranspose for 8b and 2b
d61cfcd7
fajin-corp finished dequantize for 2b, 4b, 8b
04000081
fajin-corp changed interface
819a3c80
fajin-corp fixed q4 ut
f9fff2a7
fajin-corp fixed 4bit ut
9d9bb0ef
fajin-corp debugging q8
ad90bac4
fajin-corp fixed q8 ut
26b27632
fajin-corp finished ut
978dab16
fajin-corp updating quantizer
b90938c0
fajin-corp added q8 quantizer
a5a1784b
fajin-corp Add todo
8929e694
fajin-corp fix linting
700c1d55
fajin-corp fajin-corp dismissed their stale review via 700c1d55 256 days ago
fajin-corp fajin-corp force pushed from 1be4c8eb to 700c1d55 256 days ago
fajin-corp add todo
5125c527
jiafatom
jiafatom approved these changes on 2025-04-14
liqunfu
liqunfu approved these changes on 2025-04-14
jiafatom jiafatom merged 9a993c37 into main 253 days ago
jiafatom jiafatom deleted the fajin/matmulnbit8bit_quantizer branch 253 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
Labels
Milestone