QDQ transformations in the OpenVINO EP for the NPU device #20622
sspintel
marked this pull request as ready for review 1 year ago
sspintel
changed the title [Draft] QDQ stripping transformation in OpenVINO EP QDQ stripping transformation in OpenVINO EP 1 year ago
Draft code to remove Q/DQ ops from node units in OpenVINO EP
bfd35c96
remove unnecessary code
787e5128
Rename function, lintrunner
8590e6c9
Add rulesets for Q and DQ removal
47d48f65
Handle cases for unsupported QDQ targets
0e71fb4f
Detect and skip duplicated DQs to dst graph
96cccfb3
Add QDQ stripping to separate files
e059ff30
Fix resource access bug in duplicate DQ removal
49a2b603
Add extended rule sets for each Q and DQ in a NodeUnit
0cd32c47
Remove unreachable code + NPU can take FLOAT for unsupported initiali…
2788b200
Implement a better way to dump stripped models from OVEP
6b160078
Fix rulesets
257b0410
Add OV session option for PTQ model
f3c3bbe0
Enable qdq stripping only for PTQ models
9d78b6c4
Enable is_ptq for python APIs
f378f8ef
Fix to ignore unused initializers from dst graph
e3060ac2
Revert the logic and always keep initializers for nodes that are adde…
b46adeef
Rename flag to enable qdq optimizer; Fix bug in dst graph inputs orde…
4970fffd
Make enable_qdq_optimizer change in contexts.h
cc3dd38e
Enable Q ruleset for standalone Qs & Handle standalone duplicate DQs
09ba1291
Add check for QDQ model; Address PR review comments
e5344c2c
Dump graph name is unknown when input model is serialized
19a6af4c
Fix case of a StandAlone DQ feeding to a supported Op
e833cfe8
Verbose logging of qdq optimizer status and duration
351f74b3
Fix logging of qdq optimizer status
e156246e
Add standalone duplicate DQ DT check
4b9974b3
Fix for Linux build
c8c55cb3
Fix case when Qs have const init inputs
7b4acfa7
FIx review comments
96fc477b
Fix for Pad op with no dimensions
1e920b21
Formatting fix
22ae1a75
Coverty Issues Fixed
980e0bd8
fix coverity issues
d62aaf25
Rewrite Q ruleset for Conv and MatMul
bf99ed28
Fix for node return type in debug mode
ed611659
Exception for dynamic shape models with qdq stripping
2e9bb817
Revert "Rewrite Q ruleset for Conv and MatMul"
2575b586
Fix lint issues
4d3f82ab
sspintel
force pushed
from
7efebd70
to
4d3f82ab
1 year ago
sspintel
changed the title QDQ stripping transformation in OpenVINO EP QDQ transformations in the OpenVINO EP for the NPU device 1 year ago
Fix cpplint issues
f76fca45
jywu-msft
approved these changes
on 2024-05-24
jywu-msft
merged
1765da17
into main 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub