Refactor (and fix) gpt_neox (#35610)
* start a nice modular
* Update modular_gpt_neox.py
* Update modular_gpt_neox.py
* Update modular_gpt_neox.py
* Update modular_gpt_neox.py
* update
* Update modular_gpt_neox.py
* convert
* fix attribute
* fix attrs
* oups
* fix
* fix
* fix
* fix
* fix
* fix order to pass test (see with accelerate team)
* trigger CIs
* modular
* update
* up
* Update test_modeling_gpt_neox.py
* Update test_modeling_gpt_neox.py
* trigger CIs
* correctly pass arg
* simplify
* remove key warning
* update tp -> it's compatible since the view is before
* trigger CIs