vllm
[ROCm] Support MLA with nhead<16 and FP8 KV cache for TP=8 (Kimi K2.5/Linear)
#35850
Merged

[ROCm] Support MLA with nhead<16 and FP8 KV cache for TP=8 (Kimi K2.5/Linear) #35850

ChuanLi1101
ChuanLi1101 ChuanLi1101 requested a review from tjtanaa tjtanaa 9 days ago
mergify mergify added rocm
mergify mergify added v1
gemini-code-assist
gemini-code-assist commented on 2026-03-03
wuhuikx
zejunchen-zejun
zejunchen-zejun approved these changes on 2026-03-03
tjtanaa
tjtanaa approved these changes on 2026-03-05
tjtanaa tjtanaa added ready
tjtanaa tjtanaa enabled auto-merge (squash) 7 days ago
mergify
mergify mergify added needs-rebase
ChuanLi1101 [ROCm] Support MLA with nhead<16 and FP8 KV cache for TP=8 (Kimi K2.5…
71cb317a
disabled auto-merge 6 days ago
Head branch was pushed to by a user without write access
ChuanLi1101 ChuanLi1101 force pushed from db465b38 to 71cb317a 6 days ago
mergify mergify removed needs-rebase
tjtanaa Merge branch 'main' into fix/mla-nhead-fp8-kv-upstream
04adb9e7
tjtanaa tjtanaa enabled auto-merge (squash) 6 days ago
tjtanaa tjtanaa merged c188749b into main 6 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone