vllm
[ROCm][Perf] Fused MoE W4A16 HIP kernel for AMD RDNA3 (gfx1100)
#44075
Open

[ROCm][Perf] Fused MoE W4A16 HIP kernel for AMD RDNA3 (gfx1100) #44075

JartX wants to merge 11 commits into vllm-project:main from JartX:perf/moe_rdna3_w4a16
JartX
JartX JartX requested a review from tjtanaa tjtanaa 4 days ago
JartX JartX requested a review from dllehr-amd dllehr-amd 4 days ago
JartX JartX requested a review from tlrmchlsmth tlrmchlsmth 4 days ago
JartX JartX requested a review from LucasWilkinson LucasWilkinson 4 days ago
JartX JartX requested a review from Harry-Chen Harry-Chen 4 days ago
JartX JartX requested a review from mgoin mgoin 4 days ago
JartX JartX requested a review from robertgshaw2-redhat robertgshaw2-redhat 4 days ago
JartX JartX requested a review from yewentao256 yewentao256 4 days ago
JartX JartX requested a review from pavanimajety pavanimajety 4 days ago
JartX JartX requested a review from zyongye zyongye 4 days ago
mergify mergify added ci/build
mergify mergify added rocm
mergify
JartX JartX force pushed from d0612574 to e283487d 4 days ago
AndreasKaratzas
JartX JartX force pushed from e283487d to 242c24d7 4 days ago
JartX JartX force pushed from 242c24d7 to ff3d9e93 4 days ago
JartX JartX force pushed from ff3d9e93 to 29d6afd6 4 days ago
JartX JartX requested a review from WoosukKwon WoosukKwon 4 days ago
JartX JartX requested a review from AndreasKaratzas AndreasKaratzas 4 days ago
JartX JartX force pushed from 29d6afd6 to dd0a25cd 4 days ago
depthfirst-app
depthfirst-app commented on 2026-05-30
JartX JartX force pushed from dd0a25cd to a948f5de 4 days ago
depthfirst-app
depthfirst-app commented on 2026-05-30
DarkLight1337 DarkLight1337 added verified
tjtanaa tjtanaa added ready
tjtanaa
tjtanaa
tjtanaa
tjtanaa commented on 2026-05-31
tjtanaa
tjtanaa commented on 2026-05-31
JartX JartX force pushed from a948f5de to a3a9c7dd 3 days ago
JartX JartX force pushed from a3a9c7dd to 25f4e08f 3 days ago
JartX JartX force pushed from 25f4e08f to eca6802e 3 days ago
tjtanaa
mergify
depthfirst-app
depthfirst-app commented on 2026-05-31
JartX [ROCm][Perf] Fused MoE W4A16 HIP kernel for AMD RDNA3 (gfx1100)
32c485d1
JartX JartX force pushed from eca6802e to 32c485d1 3 days ago
JartX
JartX
JartX Merge branch 'main' into perf/moe_rdna3_w4a16
2e9b8799
JartX Add M=256,512 to MoE W4A16 test token counts
2a179486
JartX Merge branch 'main' into perf/moe_rdna3_w4a16
75b7006a
JartX Merge remote-tracking branch 'origin/main' into perf/moe_rdna3_w4a16
17375ed2
JartX [Test] Add RDNA3 W4A16 compile-guard tests for gfx1100 hermeticity
eec8f411
JartX
JartX Merge branch 'main' into perf/moe_rdna3_w4a16
3c34046d
JartX [Test] Fix lint in RDNA3 compile-guard tests
c9df0dd9
depthfirst-app
depthfirst-app commented on 2026-06-01
JartX [Test] Skip RDNA3 source-guard checks when file absent from tree
2f579257
JartX [Test] Read RDNA3 python guard sources from installed vllm package
2ead8d46
BowenBao
BowenBao commented on 2026-06-02
JartX Merge branch 'main' into perf/moe_rdna3_w4a16
9939b7e2
BowenBao
BowenBao approved these changes on 2026-06-03
JartX

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone