openvino
Fix XAttention BY_TOKEN KV-cache normalization
#35183
Merged

Fix XAttention BY_TOKEN KV-cache normalization #35183

WeldonWangwang
github-actions github-actions added category: GPU
WeldonWangwang WeldonWangwang requested a review from peterchen-intel peterchen-intel 56 days ago
WeldonWangwang Fix XAttention BY_TOKEN KV-cache normalization
370ede71
WeldonWangwang WeldonWangwang marked this pull request as ready for review 55 days ago
WeldonWangwang WeldonWangwang requested a review 55 days ago
WeldonWangwang WeldonWangwang requested a review 55 days ago
WeldonWangwang Add test case
9220f968
peterchen-intel peterchen-intel requested a review from ceciliapeng2011 ceciliapeng2011 54 days ago
peterchen-intel peterchen-intel requested a review from riverlijunjie riverlijunjie 54 days ago
WeldonWangwang Simplify the conditions for changing the precision of the kv cache
093ddf43
ceciliapeng2011
ceciliapeng2011 approved these changes on 2026-04-14
riverlijunjie
riverlijunjie approved these changes on 2026-04-14
peterchen-intel peterchen-intel assigned e-ddykim e-ddykim 48 days ago
peterchen-intel Merge branch 'master' into ww/fix_xattn_by_token
5195fb5e
WeldonWangwang Fix xattention kv-cache transform test for 28-input PagedAttentionExt…
d74b27df
WeldonWangwang Merge branch 'master' into ww/fix_xattn_by_token
3ce19599
e-ddykim
e-ddykim approved these changes on 2026-04-17
e-ddykim e-ddykim enabled auto-merge 46 days ago
e-ddykim e-ddykim merged 89c806e2 into master 46 days ago

Login to write a write a comment.

Login via GitHub

Assignees
Labels
Milestone