transformers
7f552e28 - Gemma2 and flash-attention (#32188)

Commit
1 year ago
Gemma2 and flash-attention (#32188) * enable flash-attn & static cache * this works, not the prev * fix for sliding window layers * not needed anymore
Author
Parents
Loading