[`Gemma`] final fixes to the modeling #29729
gelu_pytorch_tanh
14645206
Force config.hidden_act to be approx gelu
606463fd
Merge branch 'huggingface:main' into main
16ed142b
Gemma bug fixes
03139e69
Merge branch 'huggingface:main' into main
ca3cae37
Merge branch 'huggingface:main' into main
73c24f61
force_use_exact_gelu
2b8c7f10
Update configuration_gemma.py
c1b8bef8
Update modeling_gemma.py
32656ccf
Merge branch 'huggingface:main' into main
9aa08bf7
update
557c8fc1
update for simpler handling
3d5abdbe
nit
7b3e8e8f
nit
667ca19f
fixpup
98f54a07
update
a54068ea
also update the jax modeling!
496c8073
add `"gelu_pytorch_tanh": partial(nn.gelu, approximate=True),`
9a52bbb0
fixup
840b47f5
fix order
8f0a9771
act vs act_fn
e99ab0b7
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub