onnxruntime
QDQ transformations in the OpenVINO EP for the NPU device
#20622
Merged

QDQ transformations in the OpenVINO EP for the NPU device #20622

sspintel
jywu-msft jywu-msft requested a review from skottmckay skottmckay 1 year ago
jywu-msft jywu-msft requested a review from adrianlizarraga adrianlizarraga 1 year ago
jywu-msft jywu-msft requested a review from jywu-msft jywu-msft 1 year ago
jywu-msft jywu-msft requested a review from HectorSVC HectorSVC 1 year ago
github-advanced-security
github-advanced-security commented on 2024-05-09
skottmckay
skottmckay commented on 2024-05-10
sspintel
skottmckay
sspintel
adrianlizarraga
adrianlizarraga commented on 2024-05-13
adrianlizarraga
adrianlizarraga commented on 2024-05-13
adrianlizarraga
adrianlizarraga commented on 2024-05-13
adrianlizarraga
adrianlizarraga commented on 2024-05-13
sspintel sspintel marked this pull request as ready for review 1 year ago
sspintel sspintel changed the title [Draft] QDQ stripping transformation in OpenVINO EP QDQ stripping transformation in OpenVINO EP 1 year ago
adrianlizarraga Draft code to remove Q/DQ ops from node units in OpenVINO EP
bfd35c96
adrianlizarraga remove unnecessary code
787e5128
adrianlizarraga Rename function, lintrunner
8590e6c9
sspintel Add rulesets for Q and DQ removal
47d48f65
sspintel Handle cases for unsupported QDQ targets
0e71fb4f
sspintel Detect and skip duplicated DQs to dst graph
96cccfb3
sspintel Add QDQ stripping to separate files
e059ff30
sspintel Fix resource access bug in duplicate DQ removal
49a2b603
sspintel Add extended rule sets for each Q and DQ in a NodeUnit
0cd32c47
sspintel Remove unreachable code + NPU can take FLOAT for unsupported initiali…
2788b200
sspintel Implement a better way to dump stripped models from OVEP
6b160078
sspintel Fix rulesets
257b0410
preetha-intel Add OV session option for PTQ model
f3c3bbe0
preetha-intel Enable qdq stripping only for PTQ models
9d78b6c4
sspintel Enable is_ptq for python APIs
f378f8ef
sspintel Fix to ignore unused initializers from dst graph
e3060ac2
sspintel Revert the logic and always keep initializers for nodes that are adde…
b46adeef
sspintel Rename flag to enable qdq optimizer; Fix bug in dst graph inputs orde…
4970fffd
sspintel Make enable_qdq_optimizer change in contexts.h
cc3dd38e
sspintel Enable Q ruleset for standalone Qs & Handle standalone duplicate DQs
09ba1291
sspintel Add check for QDQ model; Address PR review comments
e5344c2c
sspintel Dump graph name is unknown when input model is serialized
19a6af4c
sspintel Fix case of a StandAlone DQ feeding to a supported Op
e833cfe8
sspintel Verbose logging of qdq optimizer status and duration
351f74b3
sspintel Fix logging of qdq optimizer status
e156246e
sspintel Add standalone duplicate DQ DT check
4b9974b3
sspintel Fix for Linux build
c8c55cb3
sspintel Fix case when Qs have const init inputs
7b4acfa7
sspintel FIx review comments
96fc477b
sspintel Fix for Pad op with no dimensions
1e920b21
sspintel Formatting fix
22ae1a75
sfatimar Coverty Issues Fixed
980e0bd8
saurabhkale17 fix coverity issues
d62aaf25
sspintel Rewrite Q ruleset for Conv and MatMul
bf99ed28
sspintel Fix for node return type in debug mode
ed611659
sspintel Exception for dynamic shape models with qdq stripping
2e9bb817
sspintel Revert "Rewrite Q ruleset for Conv and MatMul"
2575b586
sspintel Fix lint issues
4d3f82ab
sspintel sspintel force pushed from 7efebd70 to 4d3f82ab 1 year ago
sspintel sspintel changed the title QDQ stripping transformation in OpenVINO EP QDQ transformations in the OpenVINO EP for the NPU device 1 year ago
sfatimar
jywu-msft
azure-pipelines
jywu-msft
azure-pipelines
sspintel Fix cpplint issues
f76fca45
jywu-msft
azure-pipelines
jywu-msft
azure-pipelines
jywu-msft
azure-pipelines
jywu-msft
jywu-msft approved these changes on 2024-05-24
jywu-msft jywu-msft merged 1765da17 into main 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone