onnxruntime
[Quantization] Tensor quant overrides and QNN EP quantization configuration
#18465
Merged

[Quantization] Tensor quant overrides and QNN EP quantization configuration #18465

adrianlizarraga merged 44 commits into main from adrianl/qnn-ep-quant-config
adrianlizarraga
adrianlizarraga First pass at supporting tensor quantization overrides
e434a68e
adrianlizarraga Run lintrunner
83139ec0
adrianlizarraga Start creating QNN specific quantization configs
3f6dd878
adrianlizarraga Try to override for MatMul, Sigmoid, Tanh
ba3c6823
adrianlizarraga Fix import issue and assert
d499d97d
adrianlizarraga Fix quant type data type
046d8f44
adrianlizarraga Add option for qdq contrib ops
98e6b757
github-advanced-security
github-advanced-security commented on 2023-11-16
HectorSVC
HectorSVC commented on 2023-11-16
adrianlizarraga Only override weights for LayerNorm and MatMul
f59a8a8b
adrianlizarraga Separate handling of MatMul and LayerNorm
cca480e2
adrianlizarraga Set DedicatedQDQPair to false
1f1e5e03
adrianlizarraga Provide a value for symmetric when overriding weight types
1182893c
adrianlizarraga Automatically set op_types_to_quantize
40d304cb
adrianlizarraga Treat bias tensors as weights if the user provides quantization overr…
6a5af05e
HectorSVC
HectorSVC commented on 2023-11-16
adrianlizarraga Use quant_type variable properly
06ca8d61
adrianlizarraga Add TODO to remove UseQDQContribOps extra_option
c7cb8993
adrianlizarraga Update tensor overrides doc comment
b0516519
adrianlizarraga
adrianlizarraga commented on 2023-11-17
adrianlizarraga Ok to quantize Transpose
724f0df9
jywu-msft jywu-msft requested a review from yufenglee yufenglee 2 years ago
jywu-msft
adrianlizarraga Merge latest from main
c35d7cf2
adrianlizarraga Treat initializers for Add, Sub, Mul, Div as weights
aa673b6f
adrianlizarraga Add qnn quantization directories to python package
476f10a8
adrianlizarraga Do not treat initializer inputs to Add/Mul as weights
c4523527
adrianlizarraga Pass in final value of reduce_range to quantize_initializer() function
c0b95f20
adrianlizarraga Get quant overrides from within quantize_initializer()
84bf83a7
github-advanced-security
github-advanced-security commented on 2023-11-21
adrianlizarraga Use quant overrides for per-channel weights
198c21f1
github-advanced-security
github-advanced-security commented on 2023-11-21
adrianlizarraga Change qType to quant_type to appease the python linter
97fd781f
adrianlizarraga Add some validation for tensor quant overrides
f9015bff
adrianlizarraga Use existing helper to check if input is an initializer
d927c658
adrianlizarraga adrianlizarraga marked this pull request as ready for review 2 years ago
adrianlizarraga
adrianlizarraga Add unit tests for tensor quant overrides (QDQ)
e5db60ce
adrianlizarraga Fix linter issues
b910f652
github-advanced-security
github-advanced-security commented on 2023-11-21
adrianlizarraga Use variable consistently
418d2b9c
adrianlizarraga Add test for bias overrides
1da0b787
adrianlizarraga Add more validation tests
49b477a8
github-advanced-security
github-advanced-security commented on 2023-11-21
adrianlizarraga Address CodeQL notices
8bb660f3
adrianlizarraga adrianlizarraga changed the title [QNN EP] Experiment with QNN EP quantization configurations [QNN EP] QNN EP quantization configurations 2 years ago
adrianlizarraga Merge latest from main
d270e451
adrianlizarraga Fix 16-bit computation of scale/zero_point for QDQ Softmax
e973fae6
adrianlizarraga Fix merge conflicts
f4c85549
adrianlizarraga Merge branch 'main' into adrianl/qnn-ep-quant-config
193490d0
adrianlizarraga Allow overriding initializer rmin, rmax, scale, or zero_point
8d2a1f88
adrianlizarraga Updated to handle per-channel quantization overrides
fca3ef6d
adrianlizarraga Clean up
8cbed6a8
adrianlizarraga Rename variable
a2627457
adrianlizarraga Add another per-channel quantization override test
bd3974c7
adrianlizarraga adrianlizarraga changed the title [QNN EP] QNN EP quantization configurations [Quantization] Tensor quant overrides and QNN EP quantization configuration 2 years ago
adrianlizarraga Merge branch 'main' into adrianl/qnn-ep-quant-config
4be070e9
yufenglee
yufenglee approved these changes on 2023-12-04
adrianlizarraga Merge latest commits from main branch
3be414b6
adrianlizarraga adrianlizarraga merged e066fca7 into main 2 years ago
adrianlizarraga adrianlizarraga deleted the adrianl/qnn-ep-quant-config branch 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone