xla
83f186d0 - [Functionalization] Slightly improve detach_copy (#4814)

Commit
2 years ago
[Functionalization] Slightly improve detach_copy (#4814) Summary: Somehow the current detach_copy has increased the memory usage of GPT-2 with FSDP a lot, see #4813. We may not implement it correctly. This fix won't fix the memory overhead as well. Test Plan: CI.
Author
Parents
Loading