onnxruntime
[CPU EP] Add blocked quantization to QuantizeLinear op kernel
#20977
Merged

[CPU EP] Add blocked quantization to QuantizeLinear op kernel #20977

fajin-corp merged 10 commits into main from fajin/qblockedquantize
fajin-corp
yufenglee
yufenglee commented on 2024-06-11
yufenglee
yufenglee commented on 2024-06-11
yufenglee
yufenglee commented on 2024-06-11
yufenglee
yufenglee commented on 2024-06-11
yufenglee
yufenglee commented on 2024-06-11
fajin-corp added baseline Q op kernel
8e2b6a62
fajin-corp finalized thread blocking approach
c3fa1cb4
fajin-corp passed build
abe14263
fajin-corp basedline passed UT
da9763aa
fajin-corp expanded UT to cover multithreading cases
4f7edfff
fajin-corp finished perf
7ac407d9
fajin-corp reset task block size
0b0075b6
fajin-corp resolve CI build failures
227386c2
fajin-corp change namings
c8afdbc8
fajin-corp fix ci warning
81d32fcf
fajin-corp fajin-corp force pushed from 2034cbd8 to 81d32fcf 1 year ago
yufenglee
yufenglee approved these changes on 2024-06-12
fajin-corp fajin-corp merged 9be30348 into main 1 year ago
fajin-corp fajin-corp deleted the fajin/qblockedquantize branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone