transformers
[Flash Attention 2] Add flash attention 2 for GPT-J
#28295
Merged

[Flash Attention 2] Add flash attention 2 for GPT-J #28295

bytebarde
bytebarde initial implementation of flash attention for gptj
6419b041
bytebarde
susnato
bytebarde modify flash attention and overwrite test_flash_attn_2_generate_paddi…
3a9e31f9
bytebarde
bytebarde update flash attention support list
cb656559
bytebarde remove the copy line in the `CodeGenBlock`
e47ef133
bytebarde bytebarde force pushed to e47ef133 2 years ago
bytebarde bytebarde changed the title [Flash Attention 2] [WIP] Add flash attention 2 for GPT-J [Flash Attention 2] Add flash attention 2 for GPT-J 2 years ago
bytebarde
bytebarde
bytebarde commented on 2024-01-04
bytebarde
bytebarde commented on 2024-01-04
bytebarde
bytebarde commented on 2024-01-04
bytebarde
bytebarde commented on 2024-01-04
ArthurZucker ArthurZucker requested a review from younesbelkada younesbelkada 2 years ago
younesbelkada
younesbelkada commented on 2024-01-08
bytebarde address copy mechanism
0c31cb3f
bytebarde
younesbelkada
younesbelkada approved these changes on 2024-01-29
younesbelkada younesbelkada requested a review from ArthurZucker ArthurZucker 2 years ago
HuggingFaceDocBuilderDev
ArthurZucker
ArthurZucker approved these changes on 2024-01-30
younesbelkada Update src/transformers/models/gptj/modeling_gptj.py
def626ef
bytebarde Merge branch 'huggingface:main' into flash_attn_gptj
6e9c7070
younesbelkada
younesbelkada commented on 2024-02-01
bytebarde Add GPTJ attention classes
af0752ea
bytebarde
ArthurZucker
ArthurZucker
ArthurZucker approved these changes on 2024-02-12
github-actions
ArthurZucker
bytebarde Merge branch 'huggingface:main' into flash_attn_gptj
cd73e337
bytebarde add expected outputs in the gptj test
cb265c73
bytebarde
younesbelkada
younesbelkada commented on 2024-03-12
bytebarde Ensure repo consistency with 'make fix-copies'
2b489b06
bytebarde
younesbelkada
younesbelkada approved these changes on 2024-03-13
younesbelkada younesbelkada merged be3fd8a2 into main 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone