Add FastGelu to kernel explorer for profiling. (#11995)
* Add FastGelu to kernel explorer for profiling.
* fix python lint errors
* Fix one more python lint error
* Delete white space (python lint)
* Various improvements.
* Update README.md
* refactor header files