transformers
b5db8ca6 - Add flash attention for `gpt_bigcode` (#26479)

Commit

2 years ago

Add flash attention for `gpt_bigcode` (#26479) * added flash attention of gpt_bigcode * changed docs * Update src/transformers/models/gpt_bigcode/modeling_gpt_bigcode.py * add FA-2 docs * oops * Update docs/source/en/perf_infer_gpu_one.md Last Nit Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fix * oops * remove padding_mask * change getattr->hasattr logic * changed .md file --------- Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: younesbelkada <younesbelkada@gmail.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

References

#26479 - Add flash attention for `gpt_bigcode`

Author

susnato

Parents

9dc4ce9e

transformers b5db8ca6 - Add flash attention for `gpt_bigcode` (#26479)

transformers
b5db8ca6 - Add flash attention for `gpt_bigcode` (#26479)