onnxruntime
4b8f6dcb
- [QNN EP] Improve INT4 accuracy (#21582)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
[QNN EP] Improve INT4 accuracy (#21582) ### Description Masks off top 4-bits of INT4 weights, improving accuracy. ### Motivation and Context This is a workaround as the QNN docs state masking is not required.
References
#21582 - [QNN EP] Improve INT4 accuracy
Author
adrianlizarraga
Parents
8540ac4f
Loading