Fix half-float conversion ops to handle tensors larger than 2B of params (#17952)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/17952
As desc.
Reviewed By: hyuen
Differential Revision: D14435092
fbshipit-source-id: dc614ba16ad531101d04d01aec8f1fbd534ebec5