onnxruntime
59871e3b
- Update qMoE spec to support block quantization (#25641)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
171 days ago
Update qMoE spec to support block quantization (#25641) Update operator spec to support block quantization in qMoE. Implementation will come later.
References
#25641 - Update qMoE spec to support block quantization
Author
tianleiwu
Parents
14ca6df1
Loading