transformers
c21e1071
- [deepspeed / m2m_100] make deepspeed zero-3 work with layerdrop (#16717)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Hide Minimap (CTRL+M)
Commit
3 years ago
[deepspeed / m2m_100] make deepspeed zero-3 work with layerdrop (#16717) * [deepspeed / m2m_100] make deepspeed 3 work with layerdrop * fix * revert last
References
#27720 - Add common processor tests
#32831 - [Docs] Update resources
#29969 - [SigLIP] Add fast tokenizer
#38622 - [AutoModelForMaskGeneration] Remove duplicate code
#33111 - [Backbone] Remove out_features everywhere
#33174 - [Zero-shot image classification pipeline] Remove tokenizer_kwargs
#19449 - [WIP] Fix weights initialization of several vision models
#16717 - [deepspeed / m2m_100] make deepspeed zero-3 work with layerdrop
Author
stas00
Parents
89293a0f
Files
1
src/transformers/models/m2m_100
modeling_m2m_100.py
Loading