transformers
5600e6f3
- Hardcode GELU as the intermediate activation for ESM (#22892)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
Hardcode GELU as the intermediate activation for ESM (#22892) * Hardcode GELU as the intermediate activation for ESM * Sneak a quick fix to the weight tying in too * Make the call to gelu explicit
References
#22892 - Hardcode GELU as the intermediate activation for ESM
#27720 - Add common processor tests
#29969 - [SigLIP] Add fast tokenizer
#32831 - [Docs] Update resources
#33111 - [Backbone] Remove out_features everywhere
#33174 - [Zero-shot image classification pipeline] Remove tokenizer_kwargs
#39821 - Support MetaCLIP 2
#59 - Fix attention mask handling in EoMT-DINOv3 converter
#62 - Add initial DEIMv2 model implementation
#65 - Fix RTDetrV2 sine position embedding ordering
Author
Rocketknight1
Parents
874c7caf
Loading