DeepSpeed
fd2f970b - Transformer-kernel - supporting any arbitrary sequence-length (#587)

Commit
5 years ago
Transformer-kernel - supporting any arbitrary sequence-length (#587) Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
Parents
Loading