Fix replication_pad for cuda launch configuration (#50565)
Summary:
Fix https://github.com/pytorch/pytorch/issues/49601
Pull Request resolved: https://github.com/pytorch/pytorch/pull/50565
Reviewed By: mruberry
Differential Revision: D25968843
Pulled By: ngimel
fbshipit-source-id: 6d2d543132b501765e69b52caaa283fb816db276