[CP] Add attention_mask to the buffer when the mask is causal #40619
Fix attention mask validation for context parallelism
0dd1b041
SunMarc
approved these changes
on 2025-09-02
Merge branch 'main' into kashif-patch-1
9a8ae937
SunMarc
enabled auto-merge (squash) 134 days ago
only split 2d attention masks
484321d7
SunMarc
merged
acc968c5
into main 133 days ago
SunMarc
deleted the kashif-patch-1 branch 133 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub