Fix bug in gpt2's (from-scratch) special scaled weight initialization #17877
only special scale init each gpt2 c_proj weight once, on exact match
0637d69e
fix double quotes
0d9b891d
sgugger
approved these changes
on 2022-06-25
sgugger
merged
e02037b3
into main 3 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub