transformers
fb0a38b4 - Move torch.compile() wrapping after DDP/FSDP wrapping to ensure correct graph breaks during training (#22279)

Commit
2 years ago
Move torch.compile() wrapping after DDP/FSDP wrapping to ensure correct graph breaks during training (#22279)
Author
Parents
Loading