transformers
0ded2815 - [`FA2`] Add flash attention for `GPT-Neo` (#26486)

Commit

2 years ago

[`FA2`] Add flash attention for `GPT-Neo` (#26486) * added flash attention for gpt-neo * small change Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * readme updated * . * changes * removed padding_mask * Update src/transformers/models/gpt_neo/modeling_gpt_neo.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

References

#26486 - [`FA2`] Add flash attention for `GPT-Neo`

Author

susnato

Parents

606d9084

transformers 0ded2815 - [`FA2`] Add flash attention for `GPT-Neo` (#26486)

transformers
0ded2815 - [`FA2`] Add flash attention for `GPT-Neo` (#26486)