transformers
acc968c5
- [CP] Add attention_mask to the buffer when the mask is causal (#40619)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
103 days ago
[CP] Add attention_mask to the buffer when the mask is causal (#40619) Fix attention mask validation for context parallelism Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
References
#40619 - [CP] Add attention_mask to the buffer when the mask is causal
Author
kashif
Parents
cb54ce4e
Loading