transformers
[Flash Attention 2] Add flash attention 2 for GPT-J
#28295

Merged

[Flash Attention 2] Add flash attention 2 for GPT-J #28295

younesbelkada merged 11 commits into huggingface:main from bytebarde:flash_attn_gptj

initial implementation of flash attention for gptj

6419b041

modify flash attention and overwrite test_flash_attn_2_generate_paddi…

3a9e31f9

update flash attention support list

cb656559

remove the copy line in the `CodeGenBlock`

e47ef133

bytebarde force pushed to e47ef133 2 years ago

bytebarde changed the title ~~[Flash Attention 2] [WIP] Add flash attention 2 for GPT-J~~ [Flash Attention 2] Add flash attention 2 for GPT-J 2 years ago

bytebarde commented on 2024-01-04

ArthurZucker requested a review from

younesbelkada 2 years ago

younesbelkada commented on 2024-01-08

address copy mechanism

0c31cb3f

younesbelkada approved these changes on 2024-01-29

younesbelkada requested a review from

ArthurZucker 2 years ago

ArthurZucker approved these changes on 2024-01-30

Update src/transformers/models/gptj/modeling_gptj.py

def626ef

Merge branch 'huggingface:main' into flash_attn_gptj

6e9c7070

younesbelkada commented on 2024-02-01

Add GPTJ attention classes

af0752ea

ArthurZucker approved these changes on 2024-02-12

Merge branch 'huggingface:main' into flash_attn_gptj

cd73e337

add expected outputs in the gptj test

cb265c73

younesbelkada commented on 2024-03-12

Ensure repo consistency with 'make fix-copies'

2b489b06

younesbelkada approved these changes on 2024-03-13

younesbelkada merged be3fd8a2 into main 1 year ago

Reviewers

ArthurZucker

younesbelkada

Assignees

No one assigned

Labels

None yet

Milestone

No milestone

transformers [Flash Attention 2] Add flash attention 2 for GPT-J #28295 Merged

[Flash Attention 2] Add flash attention 2 for GPT-J #28295

transformers
[Flash Attention 2] Add flash attention 2 for GPT-J
#28295

Merged