Re-enable SDPA's FA2 path (#30070)
* tentatively re-enable FA2 + SDPA
* better comment
* _ignore_causal_mask_sdpa as staticmethod
* type hints
* use past_seen_tokens instead
* enable copied from for sdpa
* ruff
* llama simplifications on review
* remove unnecessary self.is_causal check
* fix copies
* cleaning
* precise message
* better doc
* add test
* simplify
* Update src/transformers/models/llama/modeling_llama.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update src/transformers/models/llama/modeling_llama.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update src/transformers/models/llama/modeling_llama.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* style
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>