CUDA BFloat16 unary ops part 2 (#44824)
Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44824
Reviewed By: mruberry
Differential Revision: D23752360
Pulled By: ngimel
fbshipit-source-id: 3aadaf9db9d4e4937aa38671e8589ecbeece709d