vllm
[Model Bash] DeepSeek R1 BF16 Min Latency QKV A GEMM (0.5% E2E Speedup)
#34758
Merged

[Model Bash] DeepSeek R1 BF16 Min Latency QKV A GEMM (0.5% E2E Speedup) #34758

vllm-bot merged 12 commits into main from add-sgl-a-gemm
robertgshaw2-redhat
initial commit
9557cfc0
robertgshaw2-redhat robertgshaw2-redhat requested a review from tlrmchlsmth tlrmchlsmth 82 days ago
robertgshaw2-redhat robertgshaw2-redhat requested a review from LucasWilkinson LucasWilkinson 82 days ago
mergify mergify added ci/build
robertgshaw2-redhat robertgshaw2-redhat changed the title initial commit [Model Bash] DeepSeek R1 KV A GEMM 82 days ago
mergify mergify added deepseek
gemini-code-assist
gemini-code-assist commented on 2026-02-17
update to make the changes deepseek specific
c3d3de5c
update build
9788498b
mgoin
mgoin commented on 2026-02-18
Swich which layer
3b525e47
robertgshaw2-redhat robertgshaw2-redhat changed the title [Model Bash] DeepSeek R1 KV A GEMM [Model Bash] DeepSeek R1 BF16 KV A GEMM 82 days ago
update cmaklists
29127d49
fix build
6b7048bf
fix missing symbol
5b249984
thanks claude!
72fc0635
is new sonnet the best model?
41f9f7d1
remove duplicate
c497839b
remove debug cruft
61edbde0
robertgshaw2-redhat robertgshaw2-redhat changed the title [Model Bash] DeepSeek R1 BF16 KV A GEMM [Model Bash] DeepSeek R1 BF16 Min Latency KV A GEMM (0.5% E2E Speedup) 82 days ago
mgoin
mgoin approved these changes on 2026-02-18
address mgoin comments
c354d751
robertgshaw2-redhat robertgshaw2-redhat enabled auto-merge (squash) 82 days ago
github-actions github-actions added ready
mgoin mgoin changed the title [Model Bash] DeepSeek R1 BF16 Min Latency KV A GEMM (0.5% E2E Speedup) [Model Bash] DeepSeek R1 BF16 Min Latency QKV A GEMM (0.5% E2E Speedup) 82 days ago
vllm-bot vllm-bot merged 6874638b into main 82 days ago
vllm-bot vllm-bot deleted the add-sgl-a-gemm branch 82 days ago
eugr
SurealCereal
mgoin

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone