Fix XAttention BY_TOKEN KV-cache normalization #35183
Fix XAttention BY_TOKEN KV-cache normalization
370ede71
WeldonWangwang
marked this pull request as ready for review 55 days ago
Add test case
9220f968
Simplify the conditions for changing the precision of the kv cache
093ddf43
Merge branch 'master' into ww/fix_xattn_by_token
5195fb5e
Fix xattention kv-cache transform test for 28-input PagedAttentionExt…
d74b27df
Merge branch 'master' into ww/fix_xattn_by_token
3ce19599
e-ddykim
approved these changes
on 2026-04-17
e-ddykim
merged
89c806e2
into master 46 days ago
Login to write a write a comment.
Login via GitHub