transformers
35fe3419
- Fix flashattn wrt quantized models (#43145)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
6 days ago
Fix flashattn wrt quantized models (#43145) * fix regression * fix * fix * fix --------- Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
References
#43145 - Fix flashattn wrt quantized models
Author
SunMarc
Parents
61d7f8a4
Loading