DeepSpeed
be46ff6d - Add purely-local sliding window sparse attention config (#1962)

Commit
3 years ago
Add purely-local sliding window sparse attention config (#1962) Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
Parents
Loading