[`GPT2`] Add SDPA support #31172
vasqu
force pushed
1 year ago
vasqu
force pushed
1 year ago
vasqu
force pushed
1 year ago
`gpt2` sdpa support
3dc08bd3
vasqu
force pushed
to
3dc08bd3
1 year ago
fix (at least) one test, style, repo consistency
e425b89c
fix sdpa mask in forward --> fixes generation
ad6c985d
test
9c729a34
test2
322fb615
test3
bb30edfa
test4
3953454d
simplify shapes for attn mask creation and small comments
d963ad5d
vasqu
changed the title `GPT2` Add SDPA support [`GPT2`] Add SDPA support 1 year ago
hub fail test
91fe5338
benchmarks
4a7b1664
flash attn 2 mask should not be inverted on enc-dec setup
f0d7d2ac
fix comment
3c12ee06
apply some suggestion from code review
95b2440f
change elif logic
c6651165
[run-slow] gpt2
4811cb55
modify `test_gpt2_sample_max_time` to follow previous assertion patterns
8b33cd79
vasqu
commented
on 2024-06-07
vasqu
deleted the gpt2-sdpa branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub