transformers
0637d69e - only special scale init each gpt2 c_proj weight once, on exact match

Commit
3 years ago
only special scale init each gpt2 c_proj weight once, on exact match
Author
Parents
Loading