DeepSpeed
5208eb73 - Add Unidirectional Sparse Attention Type to BigBird and BSLongformer (#1959)

Commit
3 years ago
Add Unidirectional Sparse Attention Type to BigBird and BSLongformer (#1959) Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
Parents
Loading