Fix -1e4 as attn mask #17306
ydshieh
changed the title [WIP] Fix -1e4 as attn mask Fix -1e4 as attn mask 4 years ago
ydshieh
marked this pull request as ready for review 4 years ago
sgugger
approved these changes
on 2022-05-25
ydshieh
marked this pull request as draft 4 years ago
ydshieh
marked this pull request as ready for review 3 years ago
Use torch.finfo(self.dtype).min
0322ea5e
for GPTNeoX
eda30548
for Albert
78cb386d
For Splinter
5aaa4f93
Update src/transformers/models/data2vec/modeling_data2vec_audio.py
47cc6af5
better style
1cc0b232
fix -inf used in Bart-like models
3861fe04
Fix a few remaining -inf
217a1df6
more fix
2e5c050b
clean up
51a577d1
For CLIP
3201a0f1
For FSMT
3da95f95
clean up
b018a517
fix test
4faab4ae
Add dtype argument and use it for LayoutLMv3
a2b419c1
fix
bcf7536a
fix
73b5e654
fix conflict
5dc2a3f7
ydshieh
force pushed
to
5dc2a3f7
3 years ago
update FlaxLongT5Attention
25eac912
ydshieh
merged
d3cb2888
into main 3 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub