vllm
[PD][HeteroArch]Fix accuracy issue with CPU_ATTN as Decoder and Flash_ATTN as prefiller
#38935
Merged

[PD][HeteroArch]Fix accuracy issue with CPU_ATTN as Decoder and Flash_ATTN as prefiller #38935

xuechendi
xuechendi add post_process path for CPU
0ca6a24d
xuechendi Enable pack_kv_cache for CPU
736a3d89
xuechendi xuechendi requested a review from bigPYJ1151 bigPYJ1151 8 days ago
xuechendi xuechendi requested a review from NickLucche NickLucche 8 days ago
xuechendi xuechendi requested a review from ApostaC ApostaC 8 days ago
xuechendi xuechendi requested a review from orozery orozery 8 days ago
mergify mergify added intel-gpu
mergify mergify added cpu
mergify mergify added kv-connector
xuechendi
gemini-code-assist
gemini-code-assist commented on 2026-04-03
xuechendi Add a skip for HMA
236f573d
xuechendi Move n,h,d fetch to platform pack_kv_cache
ce3ca08f
bigPYJ1151
bigPYJ1151 approved these changes on 2026-04-07
Spycsh
bigPYJ1151 bigPYJ1151 added ready
xuechendi Merge remote-tracking branch 'origin/main' into heter_pd_with_cpu_att…
45367352
xuechendi Fix UT
260a3acd
mergify mergify added v1
bigPYJ1151 bigPYJ1151 merged ef5a2268 into main 2 days ago
NickLucche
NickLucche commented on 2026-04-09

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone