transformers
22f888b3 - [mistral] Fix FA2 attention reshape for Mistral Nemo (#32065)

Commit
1 year ago
[mistral] Fix FA2 attention reshape for Mistral Nemo (#32065) * [mistral] Fix FA2 attention reshape * [run-slow] mistral
Author
Parents
Loading