[PyTorch] Clean up native transformer implementation
Pull Request resolved: https://github.com/pytorch/pytorch/pull/78265
In preparation for supporting norm_first
Differential Revision: [D36564011](https://our.internmc.facebook.com/intern/diff/D36564011/)
Approved by: https://github.com/jbschlosser