transformers
fb0a38b4
- Move torch.compile() wrapping after DDP/FSDP wrapping to ensure correct graph breaks during training (#22279)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
Move torch.compile() wrapping after DDP/FSDP wrapping to ensure correct graph breaks during training (#22279)
References
#22279 - Move torch.compile() wrapping after DDP/FSDP wrapping to ensure correct graph breaks during training
Author
ani300
Parents
8ac29fe0
Loading