openvino
Extend moe_3gemm to all oneDNN aware GPUs
#35335
Open

Extend moe_3gemm to all oneDNN aware GPUs #35335

peterchen-intel
peterchen-intel intel_gpu: fix Qwen3 MoE GEMM3_SWIGLU on MTL-class (non-systolic) iGPU
a7fa364a
peterchen-intel intel_gpu: scope MOE3GemmFusedCompressed queue fix to in_order only
2bcc00a2
peterchen-intel intel_gpu: revert skip_transfer_on_igpu extension to MTL
d442d393
peterchen-intel intel_gpu: revert moe_gather.hpp rank-2 fix (not needed in final path)
4a0a5044
peterchen-intel peterchen-intel requested a review 50 days ago
peterchen-intel peterchen-intel requested a review 50 days ago
github-actions github-actions added category: GPU
peterchen-intel peterchen-intel requested a review from riverlijunjie riverlijunjie 50 days ago
peterchen-intel peterchen-intel assigned e-ddykim e-ddykim 50 days ago
peterchen-intel peterchen-intel added under_perf_check
peterchen-intel intel_gpu: fix oneDNN engine init for MoE on non-systolic GPU
e1a7c831
peterchen-intel peterchen-intel requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 49 days ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-04-15
rkazants
rkazants commented on 2026-04-16
peterchen-intel Merge branch 'master' into oom/fixing
8a030fd1
peterchen-intel peterchen-intel added do_not_merge
peterchen-intel
peterchen-intel intel_gpu: fix unnecessary tmp_out buffer per-layer in paged_attention
1498dd1f
peterchen-intel peterchen-intel force pushed to 1498dd1f 43 days ago
peterchen-intel Merge branch 'master' into oom/fixing
20045578
peterchen-intel peterchen-intel removed under_perf_check
peterchen-intel peterchen-intel removed do_not_merge
peterchen-intel Merge branch 'master' into oom/fixing
c9eb5c68
peterchen-intel peterchen-intel added under_perf_check
peterchen-intel
peterchen-intel peterchen-intel changed the title Extend moe_3gemm to all Ultra series iGPU Extend moe_3gemm to all GPUs and reduce internal buffer holding in paged_attention_opt 42 days ago
ceciliapeng2011
riverlijunjie
riverlijunjie commented on 2026-04-24
peterchen-intel Merge branch 'master' into oom/fixing
046c575a
riverlijunjie
riverlijunjie approved these changes on 2026-05-11
peterchen-intel Merge branch 'master' into oom/fixing
3ebb016f
peterchen-intel Roll back the mistaken change
990c0bc7
peterchen-intel Merge branch 'master' into oom/fixing
39ba7fca
peterchen-intel
peterchen-intel commented on 2026-05-21
peterchen-intel iintel_gpu: enable compressed MoE fusion chain on non-systolic devices
2ee0c089
peterchen-intel peterchen-intel force pushed to 2ee0c089 12 days ago
peterchen-intel Merge branch 'master' into oom/fixing
12847654
peterchen-intel peterchen-intel changed the title Extend moe_3gemm to all GPUs and reduce internal buffer holding in paged_attention_opt Extend moe_3gemm to all Intel GPUs 12 days ago
peterchen-intel peterchen-intel requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 12 days ago
peterchen-intel peterchen-intel requested a review from riverlijunjie riverlijunjie 12 days ago
peterchen-intel peterchen-intel requested a review from rkazants rkazants 12 days ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-05-22
peterchen-intel Support oneDNN known gpu_arch only
f8d0cc30
peterchen-intel peterchen-intel requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 1 day ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-06-02
peterchen-intel device_info.arch >= cldnn::gpu_arch::xe_lp
3875448b
peterchen-intel peterchen-intel changed the title Extend moe_3gemm to all Intel GPUs Extend moe_3gemm to all oneDNN known GPUs 1 day ago
peterchen-intel peterchen-intel changed the title Extend moe_3gemm to all oneDNN known GPUs Extend moe_3gemm to all oneDNN aware GPUs 1 day ago
peterchen-intel peterchen-intel requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 1 day ago
peterchen-intel Merge branch 'master' into oom/fixing
ae72a35e
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-06-02
peterchen-intel
peterchen-intel commented on 2026-06-02
peterchen-intel
peterchen-intel commented on 2026-06-02
peterchen-intel
peterchen-intel commented on 2026-06-02
peterchen-intel For ENABLE_ONEDNN_FOR_GPU only
fa26fd1f
peterchen-intel
peterchen-intel commented on 2026-06-02
peterchen-intel Remove duplicate
4ce2b9a5
peterchen-intel Remove duplicate
dfd234cc
peterchen-intel peterchen-intel requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 1 day ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-06-02
peterchen-intel Define disable_moe_opt only if ENABLE_ONEDNN_FOR_GPU
8edab885

Login to write a write a comment.

Login via GitHub

Assignees
Labels
Milestone