vllm
[ROCm][Perf] Fused MoE W4A16 HIP kernel for AMD RDNA3 (gfx1100)
#44075
Open
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
11
Changes
View On
GitHub
[ROCm][Perf] Fused MoE W4A16 HIP kernel for AMD RDNA3 (gfx1100)
#44075
JartX
wants to merge 11 commits into
vllm-project:main
from
JartX:perf/moe_rdna3_w4a16
JartX
requested a review
from
tjtanaa
4 days ago
JartX
requested a review
from
dllehr-amd
4 days ago
JartX
requested a review
from
tlrmchlsmth
4 days ago
JartX
requested a review
from
LucasWilkinson
4 days ago
JartX
requested a review
from
Harry-Chen
4 days ago
JartX
requested a review
from
mgoin
4 days ago
JartX
requested a review
from
robertgshaw2-redhat
4 days ago
JartX
requested a review
from
yewentao256
4 days ago
JartX
requested a review
from
pavanimajety
4 days ago
JartX
requested a review
from
zyongye
4 days ago
mergify
added
ci/build
mergify
added
rocm
JartX
force pushed
from
d0612574
to
e283487d
4 days ago
JartX
force pushed
from
e283487d
to
242c24d7
4 days ago
JartX
force pushed
from
242c24d7
to
ff3d9e93
4 days ago
JartX
force pushed
from
ff3d9e93
to
29d6afd6
4 days ago
JartX
requested a review
from
WoosukKwon
4 days ago
JartX
requested a review
from
AndreasKaratzas
4 days ago
JartX
force pushed
from
29d6afd6
to
dd0a25cd
4 days ago
depthfirst-app
commented on 2026-05-30
JartX
force pushed
from
dd0a25cd
to
a948f5de
4 days ago
depthfirst-app
commented on 2026-05-30
DarkLight1337
added
verified
tjtanaa
added
ready
tjtanaa
commented on 2026-05-31
tjtanaa
commented on 2026-05-31
JartX
force pushed
from
a948f5de
to
a3a9c7dd
3 days ago
JartX
force pushed
from
a3a9c7dd
to
25f4e08f
3 days ago
JartX
force pushed
from
25f4e08f
to
eca6802e
3 days ago
depthfirst-app
commented on 2026-05-31
[ROCm][Perf] Fused MoE W4A16 HIP kernel for AMD RDNA3 (gfx1100)
32c485d1
JartX
force pushed
from
eca6802e
to
32c485d1
3 days ago
Merge branch 'main' into perf/moe_rdna3_w4a16
2e9b8799
Add M=256,512 to MoE W4A16 test token counts
2a179486
Merge branch 'main' into perf/moe_rdna3_w4a16
75b7006a
Merge remote-tracking branch 'origin/main' into perf/moe_rdna3_w4a16
17375ed2
[Test] Add RDNA3 W4A16 compile-guard tests for gfx1100 hermeticity
eec8f411
Merge branch 'main' into perf/moe_rdna3_w4a16
3c34046d
[Test] Fix lint in RDNA3 compile-guard tests
c9df0dd9
depthfirst-app
commented on 2026-06-01
[Test] Skip RDNA3 source-guard checks when file absent from tree
2f579257
[Test] Read RDNA3 python guard sources from installed vllm package
2ead8d46
BowenBao
commented on 2026-06-02
Merge branch 'main' into perf/moe_rdna3_w4a16
9939b7e2
BowenBao
approved these changes on 2026-06-03
Login to write a write a comment.
Login via GitHub
Reviewers
BowenBao
depthfirst-app
tjtanaa
bnellnm
dllehr-amd
tlrmchlsmth
LucasWilkinson
Harry-Chen
mgoin
robertgshaw2-redhat
yewentao256
pavanimajety
zyongye
WoosukKwon
AndreasKaratzas
Assignees
No one assigned
Labels
rocm
ready
ci/build
verified
Milestone
No milestone
Login to write a write a comment.
Login via GitHub