diffusers
47455bd1 - Fix Flash Attention 3 interface for new FA3 return format (#13173)

Commit

129 days ago

Fix Flash Attention 3 interface for new FA3 return format (#13173) * Fix Flash Attention 3 interface compatibility for new FA3 versions Newer versions of flash-attn (after Dao-AILab/flash-attention@ed20940) no longer return lse by default from flash_attn_3_func. The function now returns just the output tensor unless return_attn_probs=True is passed. Updated _wrapped_flash_attn_3 and _flash_varlen_attention_3 to pass return_attn_probs and handle both old (always tuple) and new (tensor or tuple) return formats gracefully. Fixes #12022 * Simplify _wrapped_flash_attn_3 return unpacking Since return_attn_probs=True is always passed, the result is guaranteed to be a tuple. Remove the unnecessary isinstance guard.

References

#13173 - Fix Flash Attention 3 interface for new FA3 return format

Author

veeceey

Parents

97c2c6e3

diffusers 47455bd1 - Fix Flash Attention 3 interface for new FA3 return format (#13173)

diffusers
47455bd1 - Fix Flash Attention 3 interface for new FA3 return format (#13173)