Gemma2 and flash-attention (#32188)

Commit

1 year ago

Gemma2 and flash-attention (#32188) * enable flash-attn & static cache * this works, not the prev * fix for sliding window layers * not needed anymore