transformers
Fix bug in gpt2's (from-scratch) special scaled weight initialization
#17877
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
2
Changes
View On
GitHub
Loading