transformers
[CP] Add attention_mask to the buffer when the mask is causal
#40619
Merged

[CP] Add attention_mask to the buffer when the mask is causal #40619

SunMarc merged 3 commits into main from kashif-patch-1
kashif
kashif Fix attention mask validation for context parallelism
0dd1b041
kashif kashif requested a review from SunMarc SunMarc 134 days ago
HuggingFaceDocBuilderDev
SunMarc
SunMarc approved these changes on 2025-09-02
SunMarc Merge branch 'main' into kashif-patch-1
9a8ae937
SunMarc SunMarc enabled auto-merge (squash) 134 days ago
SunMarc
S1ro1
kashif
kashif
S1ro1
kashif only split 2d attention masks
484321d7
kashif
SunMarc SunMarc merged acc968c5 into main 133 days ago
SunMarc SunMarc deleted the kashif-patch-1 branch 133 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone