vllm
[Attention][Perf] Optimize cp_gather_and_upconvert_fp8_kv_cache - DeepSeek-v3.2
#35290
Merged

[Attention][Perf] Optimize cp_gather_and_upconvert_fp8_kv_cache - DeepSeek-v3.2 #35290

LopezCastroRoberto
LopezCastroRoberto init pr
33c96caf
LopezCastroRoberto LopezCastroRoberto marked this pull request as draft 74 days ago
gemini-code-assist
gemini-code-assist commented on 2026-02-25
LopezCastroRoberto LopezCastroRoberto changed the title [Perf][WIP] Optimize cp_gather_and_upconvert_fp8_kv_cache [Attention][Perf][WIP] Optimize cp_gather_and_upconvert_fp8_kv_cache 74 days ago
LopezCastroRoberto vectorized ops
b5e7b8f9
LopezCastroRoberto LopezCastroRoberto changed the title [Attention][Perf][WIP] Optimize cp_gather_and_upconvert_fp8_kv_cache [Attention][Perf][WIP] Optimize cp_gather_and_upconvert_fp8_kv_cache - DeepSeek-v3.2 73 days ago
mergify mergify added deepseek
LopezCastroRoberto Adding tests and benchmark
6a17f099
mergify mergify added performance
LopezCastroRoberto Adding more tests
4ccd4b35
LopezCastroRoberto LopezCastroRoberto marked this pull request as ready for review 72 days ago
LopezCastroRoberto LopezCastroRoberto requested a review from mgoin mgoin 72 days ago
LopezCastroRoberto LopezCastroRoberto requested a review from tlrmchlsmth tlrmchlsmth 72 days ago
LopezCastroRoberto LopezCastroRoberto requested a review from WoosukKwon WoosukKwon 72 days ago
LopezCastroRoberto LopezCastroRoberto requested a review from yewentao256 yewentao256 72 days ago
LucasWilkinson
LucasWilkinson approved these changes on 2026-03-04
LucasWilkinson LucasWilkinson enabled auto-merge (squash) 67 days ago
github-actions github-actions added ready
LopezCastroRoberto LopezCastroRoberto changed the title [Attention][Perf][WIP] Optimize cp_gather_and_upconvert_fp8_kv_cache - DeepSeek-v3.2 [Attention][Perf] Optimize cp_gather_and_upconvert_fp8_kv_cache - DeepSeek-v3.2 67 days ago
LopezCastroRoberto Merge branch 'main' into perf/gather_upconvert
e0635332
LopezCastroRoberto Merge branch 'main' into perf/gather_upconvert
53bd6e6a
LopezCastroRoberto Merge branch 'main' into perf/gather_upconvert
aa06e41c
LopezCastroRoberto Merge branch 'main' into perf/gather_upconvert
5e1d1263
LopezCastroRoberto Merge branch 'main' into perf/gather_upconvert
d9c6aeda
LopezCastroRoberto fix amd path
fd8e5161
disabled auto-merge 64 days ago
Head branch was pushed to by a user without write access
LucasWilkinson LucasWilkinson enabled auto-merge (squash) 64 days ago
LopezCastroRoberto Merge branch 'main' into perf/gather_upconvert
f1ac4559
mgoin
mgoin approved these changes on 2026-03-09
mgoin mgoin added nvidia
vllm-bot vllm-bot merged 2b28b9b2 into main 61 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone