transformers
46df8599 - [`GPTNeoX`] Flex Attention + Refactor (#34896)

Commit

1 year ago

[`GPTNeoX`] Flex Attention + Refactor (#34896) * gpt neox flex attention + refactor * some formatting * small fix on dropout * add assertion on flex attn test * flaky ci :( * add head mask support * style * handle dtype, replace torch where * fixup flex with output attns * code review and several other fixes * Update src/transformers/modeling_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * style * remove unnecessary comment * remove incorrect comment * make flex attn check more agnostic tor versions and centralized * change peft input dtype check to value since q and k could be affected by other stuff like RoPE * i forgor * flaky * code review and small fixes * Update src/transformers/models/gpt_neox/modeling_gpt_neox.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

References

#34896 - [`GPTNeoX`] Flex Attention + Refactor

Author

vasqu

Parents

accb7204

transformers 46df8599 - [`GPTNeoX`] Flex Attention + Refactor (#34896)

transformers
46df8599 - [`GPTNeoX`] Flex Attention + Refactor (#34896)