openvino
[GPU] MoE 3 GeMM: separate Router Subgraph and MoE body kernels
#35691
Open
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
14
Changes
View On
GitHub
[GPU] MoE 3 GeMM: separate Router Subgraph and MoE body kernels
#35691
v-Golubev
wants to merge 14 commits into
openvinotoolkit:master
from
v-Golubev:vg/transformations/moe_extract_router_to_separate_node
Tests extending
f00f4c5c
style
7411b115
[Transformations] ConvertTiledMoeBlockTo3GatherMatmuls: Gelu activati…
172ba935
Add MoERouterFused op/primitive infrastructure (no behavior change)
d060d71a
Add standalone MoERouterFused accuracy tests (8 tests pass)
f5c773df
Split routing logic from MOE3Gemm into MoERouterFused
2f136d71
Update accuracy tests for two-primitive router+MOE flow
9f7c122d
Remove dead routing kernel classes and fix functional test assertion
584f907c
Remove wasted topk internal buffer allocations from MOE3Gemm
3bce0a91
Update primitive header comments to reflect new input layout
881a54a7
Unify MOE3GemmFusedCompressed into MOECompressed op
5a328a82
Replace FuseMOE3GemmCompressed with FuseMoERouter
325a937c
cleanup
8be5d029
cleanup
5a2b056a
v-Golubev
requested a review
1 day ago
v-Golubev
requested a review
1 day ago
v-Golubev
requested a review
1 day ago
v-Golubev
added
WIP
github-actions
added
category: GPU
github-actions
added
category: transformations
Login to write a write a comment.
Login via GitHub
Reviewers
No reviews
Assignees
No one assigned
Labels
category: GPU
category: transformations
WIP
Milestone
No milestone
Login to write a write a comment.
Login via GitHub