DeepSpeed
542b25c4 - fixing the sparse attention for low block-size

Commit
4 years ago
fixing the sparse attention for low block-size
Author
Reza Yazdani
Parents
Loading