transformers
Fix T5Attention shape mismatch under Tensor Parallelism
#45109
Merged

Fix T5Attention shape mismatch under Tensor Parallelism #45109

aws-zhanxun
aws-zhanxun Fix T5Attention shape mismatch under Tensor Parallelism
e9f299e1
Rocketknight1
vasqu
vasqu approved these changes on 2026-03-30
aws-zhanxun Run make fix-repo: propagate TP view fix to copied models
d4f43580
aws-zhanxun Refactor view() calls to use shape tuples per review
8680ba54
vasqu
vasqu approved these changes on 2026-03-31
aws-zhanxun Chain view() calls per reviewer suggestion
89d64b49
github-actions
vasqu
vasqu
github-actions
github-actions
github-actions
github-actions
vasqu vasqu enabled auto-merge 46 days ago
HuggingFaceDocBuilderDev
vasqu vasqu merged d55f0350 into main 46 days ago
vasqu

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone