[ROCm][AITER] Fix aiter paged_attention_v1 decode for sliding window and head_size < 64 #34570
[ROCm][AITER] Fix ROCm AITER FA attn backend decode fallback for slid…
b4f814f5
Fix off-by-one in unified_attention cu_seqlens_q and descale_shape
002d0eb6
Merge remote-tracking branch 'origin/main' into akaratza_fix_aiter_fa
fe6b5c96
Guard unified_attention on head_size only; restore sliding window arg…
f4f23b13
Merge remote-tracking branch 'origin/main' into akaratza_fix_aiter_fa
b257acbc
Merge remote-tracking branch 'origin/main' into akaratza_fix_aiter_fa
c09f5445
Merge remote-tracking branch 'origin/main' into akaratza_fix_aiter_fa
f327c2fb
Merge remote-tracking branch 'origin/main' into akaratza_fix_aiter_fa
ddac40d9
vllm-bot
merged
cf93c1a1
into main 86 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub