transformers
Flash Attention 2 support for RoCm
#27611
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
45
Changes
View On
GitHub
Flash Attention 2 support for RoCm
#27611
fxmarty
merged 45 commits into
huggingface:main
from
fxmarty:flash-attn-2-support-rocm
support FA2
639e22b0
fix typo
100a464e
fix broken tests
0279b14e
fix more test errors
f449e86e
left/right
cd076a1b
fix bug
40d6a95b
more test
dc192b75
typo
2ed4492c
fix layout flash attention falcon
0cf3fec7
do not support this case
414dc77b
use allclose instead of equal
333c3e60
fxmarty
commented on 2023-11-20
fxmarty
commented on 2023-11-20
fix various bugs with flash attention
26f0455e
bump
f0dd02c7
fix test
278ef41f
fix mistral
321bf4bd
use skiptest instead of return that may be misleading
4712b483
Merge branch 'fix-bugs-flash' into flash-attn-2-support-rocm
97b25f68
add fix causal arg flash attention
a6b1633e
ArthurZucker
commented on 2023-11-21
fix copies
669f545a
more explicit comment
2df227ae
still use self.is_causal
16b73d78
fix causal argument
685c7ce6
comment
9c8c6f75
Merge branch 'main' into flash-attn-2-support-rocm
851343fc
fixes
7dfe765a
update documentation
3ab0d309
Merge branch 'flash-attn-2-support-rocm' of https://github.com/fxmart…
1f6ae8a4
add link
bbc1f5aa
wrong test
d0869c5c
fxmarty
marked this pull request as ready for review
2 years ago
fxmarty
requested a review
from
mfuntowicz
2 years ago
fxmarty
requested a review
from
ArthurZucker
2 years ago
fxmarty
requested a review
from
amyeroberts
2 years ago
fxmarty
requested a review
from
younesbelkada
2 years ago
simplify FA2 RoCm requirements
e76188c5
Merge branch 'flash-attn-2-support-rocm' of https://github.com/fxmart…
52e83f9e
Merge branch 'main' into flash-attn-2-support-rocm
ae7f4864
update opt
e4a18852
mfuntowicz
approved these changes on 2023-11-23
LysandreJik
approved these changes on 2023-11-24
younesbelkada
commented on 2023-11-24
make flash_attn_uses_top_left_mask attribute private and precise comment
ce6e1473
better error handling
5a603c09
ArthurZucker
approved these changes on 2023-11-24
amyeroberts
approved these changes on 2023-11-24
younesbelkada
approved these changes on 2023-11-24
Merge branch 'main' into flash-attn-2-support-rocm
c4809e6b
fix copy & mistral
6cf3bb7c
Update src/transformers/modeling_utils.py
8ddbb298
Update src/transformers/modeling_utils.py
55b36e42
Update src/transformers/modeling_utils.py
9d0289b5
Update src/transformers/utils/import_utils.py
60b3163c
use is_flash_attn_greater_or_equal_2_10 instead of is_flash_attn_grea…
57025a4c
fix merge
87ac9499
simplify
733b9509
inline args
3926b06d
fxmarty
merged
1da1302e
into main
2 years ago
Login to write a write a comment.
Login via GitHub
Reviewers
ArthurZucker
LysandreJik
mfuntowicz
younesbelkada
amyeroberts
Assignees
No one assigned
Labels
None yet
Milestone
No milestone
Login to write a write a comment.
Login via GitHub