Add channels last support to cuda.comm.scatter and gather
Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28077
Test Plan: Imported from OSS
Differential Revision: D17980305
Pulled By: VitalyFedyunin
fbshipit-source-id: e4741194baac3d93f2d53724582dc4c38f82ee84