DeepSpeed
fix: correct DistributedAttention output shape and pad uneven sequence lengths (#7842)
#7868
Open

fix: correct DistributedAttention output shape and pad uneven sequence lengths (#7842) #7868

harshang03
harshang03 fix: correct DistributedAttention output shape and pad uneven sequenc…
aa22cbbb
PKUWZP PKUWZP requested a review from PKUWZP PKUWZP 78 days ago
PKUWZP
PKUWZP requested changes on 2026-02-22

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone