DeepSpeed
e193901a - Fix for fragmented linear inputs in ZeRO 3 Linear layers where reshape is needed instead of view. Should not affect performance for cases which only requires view since reshape will just do view when possible

Commit
4 years ago
Fix for fragmented linear inputs in ZeRO 3 Linear layers where reshape is needed instead of view. Should not affect performance for cases which only requires view since reshape will just do view when possible
Author
Parents
Loading