onnxruntime
f30581ed - [CPU EP] Add block quantized Gather contrib op (#21630)

Commit
1 year ago
[CPU EP] Add block quantized Gather contrib op (#21630) ### Description Add a gather that supports block-quantized input data. ### Motivation and Context To support Web inference scenario with quantized vocabulary embeddings.
Author
Parents
Loading