onnxruntime
59871e3b - Update qMoE spec to support block quantization (#25641)

Commit
171 days ago
Update qMoE spec to support block quantization (#25641) Update operator spec to support block quantization in qMoE. Implementation will come later.
Author
Parents
Loading