vllm
[Perf] Create TMA-aligned input scale tensor for DeepGemm on Hopper
#32619
Merged

[Perf] Create TMA-aligned input scale tensor for DeepGemm on Hopper #32619

mgoin merged 2 commits into vllm-project:main from xyang16:tma_aligned
xyang16
xyang16 xyang16 requested a review from mgoin mgoin 36 days ago
xyang16 xyang16 requested a review from tlrmchlsmth tlrmchlsmth 36 days ago
xyang16 xyang16 requested a review from WoosukKwon WoosukKwon 36 days ago
xyang16 xyang16 requested a review from yewentao256 yewentao256 36 days ago
xyang16 xyang16 requested a review from robertgshaw2-redhat robertgshaw2-redhat 36 days ago
xyang16 xyang16 requested a review from pavanimajety pavanimajety 36 days ago
mergify mergify added performance
xyang16 xyang16 changed the title [Kernel] Create TMA-aligned input scale tensor for DeepGemm on Hopper [Perf] Create TMA-aligned input scale tensor for DeepGemm on Hopper 36 days ago
gemini-code-assist
gemini-code-assist commented on 2026-01-19
heheda12345
mgoin
mgoin approved these changes on 2026-01-22
mgoin mgoin added ready
mgoin mgoin added deepseek
mgoin mgoin assigned mgoin mgoin 34 days ago
xyang16 [Kernel] Create TMA-aligned input scale tensor for DeepGemm on Hopper
5a71a0fd
xyang16 xyang16 force pushed to 5a71a0fd 34 days ago
xyang16 Review changes
4c7d0530
xyang16
xyang16 commented on 2026-01-22
mgoin mgoin changed the title [Perf] Create TMA-aligned input scale tensor for DeepGemm on Hopper [Perf] Create TMA-aligned input scale tensor for DeepGemm SM90/SM100 33 days ago
mgoin mgoin changed the title [Perf] Create TMA-aligned input scale tensor for DeepGemm SM90/SM100 [Perf] Create TMA-aligned input scale tensor for DeepGemm on Hopper 33 days ago
mgoin mgoin merged d08b356e into main 33 days ago
xyang16 xyang16 deleted the tma_aligned branch 33 days ago

Login to write a write a comment.

Login via GitHub

Assignees
Labels
Milestone