DeepSpeed
2466fd9d
- packed flash attn with mask works
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
packed flash attn with mask works
References
#4337 - adds triton flash attention2 kernel
Author
styoun
Parents
95456d0e
Loading