transformers
Add a use_parallel_residual argument to control the residual computing way
#18695
Merged

Add a use_parallel_residual argument to control the residual computing way #18695

sgugger merged 3 commits into huggingface:main from main
NinedayWang
HuggingFaceDocBuilderDev
NinedayWang NinedayWang marked this pull request as draft 3 years ago
NinedayWang NinedayWang marked this pull request as ready for review 3 years ago
urialon
patrickvonplaten
VHellendoorn
patrickvonplaten
VHellendoorn
patrickvonplaten
VHellendoorn
sgugger
NinedayWang
patrickvonplaten
NinedayWang
patrickvonplaten
sgugger
sgugger commented on 2022-09-14
NinedayWang Add a gpt_j_residual argument to control the residual computing way
328e0643
NinedayWang Put duplicate code outside of the if block
70aaec5c
NinedayWang Rename parameter "gpt_j_residual" to "use_parallel_residual" and set …
4c12b692
NinedayWang
patrickvonplaten
patrickvonplaten commented on 2022-09-27
patrickvonplaten
patrickvonplaten commented on 2022-09-27
patrickvonplaten patrickvonplaten changed the title Add a gpt_j_residual argument to control the residual computing way Add a use_parallel_residual argument to control the residual computing way 3 years ago
patrickvonplaten
patrickvonplaten approved these changes on 2022-09-27
sgugger
sgugger approved these changes on 2022-09-27
sgugger sgugger merged 226b0e46 into main 3 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone