Move the body of fill_kernel_impl into fill_kernel_cuda
Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22760
Test Plan: Imported from OSS
Differential Revision: D16257782
Pulled By: ezyang
fbshipit-source-id: d214d2d77affd937109b33ca841af76004f85834