transformers
Fix Causality Handling in Flash Attention to Support Bidirectional Attention
#39707
Merged

Loading