Helper function to wrap entry HLO (#3920)
* Helper function to wrap entry HLO
* Use the wrapped computation during computation
* don't tuple the wrapped hlo's result
* clean up
* preserve aliasing
* Add XLA_PARAMETER_WRAPPING_THREADSHOLD
* code refactor
* Add debug VLOG
* Add test
* clean up
* clean up
* address review comments