Ported gelu decomp to ref (#78697)
Ugh... these are actually so painful to write without operator overloading lol.
Decided to just utilize operator overloading, and xfail the ref tests for now.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/78697
Approved by: https://github.com/mruberry