[Bugfix] Fix TP inference for Flex attention backend #19657
fix tp with flex attn
67199afd
disable compile for tp
55d41b71
add engine core tp test
b4f110f6
check cpu num blocks as well
30b4f921
houseroad
approved these changes
on 2025-06-15
Isotr0py
enabled auto-merge (squash) 208 days ago
Isotr0py
merged
1173804d
into main 208 days ago
Isotr0py
deleted the flex-tp branch 208 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub