vllm
[EP+DP] Optimize the little operations in the DeepGEMM + DeepEP low latency case
#19885
Merged

[EP+DP] Optimize the little operations in the DeepGEMM + DeepEP low latency case #19885

WoosukKwon merged 10 commits into main from ll_deepgemm_opt
tlrmchlsmth
deep_ep + use_fp8_dispatch
8de2fd39
tlrmchlsmth Merge remote-tracking branch 'nm/varun/deepep-fp8-dispatch' into ll_d…
104a984e
tlrmchlsmth DeepGEMM LL optimizations
299f8291
fixes - use-fp8-dispatch
2b5ad9f2
github-actions
gemini-code-assist
gemini-code-assist commented on 2025-06-20
mergify mergify added qwen
gemini-code-assist
gemini-code-assist commented on 2025-06-20
mgoin
mgoin approved these changes on 2025-06-20
mgoin mgoin added deepseek
mgoin mgoin added performance
tlrmchlsmth Unit test
d5f20676
tlrmchlsmth fixes
26fd8ca3
tlrmchlsmth precommit
7a821f0e
tlrmchlsmth tlrmchlsmth requested a review from WoosukKwon WoosukKwon 339 days ago
tlrmchlsmth tweaks
39d5d33f
tlrmchlsmth fixup
21ffc735
tlrmchlsmth tlrmchlsmth enabled auto-merge (squash) 339 days ago
github-actions github-actions added ready
tlrmchlsmth tolerances
b4f17e12
disabled auto-merge 336 days ago
Manually disabled by user
WoosukKwon WoosukKwon merged 68aaeb37 into main 336 days ago
WoosukKwon WoosukKwon deleted the ll_deepgemm_opt branch 336 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone