onnxruntime
Update MoE and qMoE spec
#25619
Merged

Update MoE and qMoE spec #25619

tianleiwu merged 16 commits into main from tlwu/moe_spec
tianleiwu
tianleiwu update moe spec
e7cb8448
tianleiwu tianleiwu marked this pull request as draft 207 days ago
github-actions
github-actions commented on 2025-08-01
github-advanced-security
github-advanced-security commented on 2025-08-01
tianleiwu update doc
bd36de44
github-actions
github-actions commented on 2025-08-01
tianleiwu format
1d70f69d
tianleiwu add swiglu limit
451814f9
tianleiwu Merge branch 'main' into tlwu/moe_spec
fa0224cd
tianleiwu CPU change from apsonawane
4a0d84f4
tianleiwu use moe_helper in CPU
03a61467
tianleiwu remove MoEQuantType
6a5871e8
tianleiwu Fix build
c4eb332a
tianleiwu Add swiglu parameters
08c31146
tianleiwu Merge branch 'main' into tlwu/moe_spec
b7de4a7c
tianleiwu update doc
edd065cb
tianleiwu improve backward compatible
1b72088f
tianleiwu tianleiwu marked this pull request as ready for review 206 days ago
tianleiwu Revert "emsdk" change
b1562ddc
tianleiwu refacotring
37abf5d5
tianleiwu Disable cpu qmoe test
0635f113
kunal-vaishnavi
kunal-vaishnavi commented on 2025-08-02
kunal-vaishnavi
kunal-vaishnavi commented on 2025-08-02
kunal-vaishnavi
kunal-vaishnavi approved these changes on 2025-08-02
tianleiwu tianleiwu merged 562760a5 into main 206 days ago
tianleiwu tianleiwu deleted the tlwu/moe_spec branch 206 days ago
jywu-msft jywu-msft added release:1.23.0
tianleiwu tianleiwu added cherry-picked
tianleiwu tianleiwu removed release:1.23.0

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone