enable data type propagation (#98065)
Enable data type propagation in schedule node level.
Propagation policy:
(1) ops with dtype args [constant, load, rand, randn] -> direct use dtype as node dtype
(2) ops semantics decide output dtype -> using output dtype
All `override_return_dtype` in https://github.com/pytorch/pytorch/blob/master/torch/_inductor/lowering.py.
(3) other ops: perform promote on input nodes dtype. ADD(BF16, FP32) -> FP32 output.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/98065
Approved by: https://github.com/EikanWang, https://github.com/jansel, https://github.com/jgong5