vllm
[Bugfix] Fix TP inference for Flex attention backend
#19657
Merged

[Bugfix] Fix TP inference for Flex attention backend #19657

Isotr0py merged 4 commits into vllm-project:main from Isotr0py:flex-tp
Isotr0py
Isotr0py fix tp with flex attn
67199afd
Isotr0py disable compile for tp
55d41b71
Isotr0py Isotr0py requested a review from WoosukKwon WoosukKwon 209 days ago
Isotr0py Isotr0py requested a review from robertgshaw2-redhat robertgshaw2-redhat 209 days ago
Isotr0py Isotr0py requested a review from njhill njhill 209 days ago
Isotr0py Isotr0py requested a review from ywang96 ywang96 209 days ago
Isotr0py Isotr0py requested a review from comaniac comaniac 209 days ago
Isotr0py Isotr0py requested a review from alexm-redhat alexm-redhat 209 days ago
github-actions
gemini-code-assist
gemini-code-assist commented on 2025-06-15
mergify mergify added v1
mergify mergify added tpu
gemini-code-assist
gemini-code-assist commented on 2025-06-15
Isotr0py Isotr0py requested a review from houseroad houseroad 209 days ago
houseroad
houseroad commented on 2025-06-15
Isotr0py add engine core tp test
b4f110f6
Isotr0py check cpu num blocks as well
30b4f921
houseroad
houseroad approved these changes on 2025-06-15
Isotr0py Isotr0py enabled auto-merge (squash) 208 days ago
github-actions github-actions added ready
Isotr0py Isotr0py merged 1173804d into main 208 days ago
Isotr0py Isotr0py deleted the flex-tp branch 208 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone