Move the CUDA implementation of round to ATen. (#25041)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/25041
Fix #24617
Pull Request resolved: https://github.com/pytorch/pytorch/pull/25041
Test Plan: Imported from OSS
Differential Revision: D17114368
Pulled By: VitalyFedyunin
fbshipit-source-id: 6ec6ef99b4451acd7e93491fd4b44fca9ce1809d