onnxruntime
f30581ed
- [CPU EP] Add block quantized Gather contrib op (#21630)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
[CPU EP] Add block quantized Gather contrib op (#21630) ### Description Add a gather that supports block-quantized input data. ### Motivation and Context To support Web inference scenario with quantized vocabulary embeddings.
References
#21630 - [CPU EP] Add block quantized Gather contrib op
Author
fajin-corp
Parents
702b2e28
Loading