Revert D22011184: [pytorch][PR] Fix CUDA device guard usage when first arg of kernel is scalar
Test Plan: revert-hammer
Differential Revision:
D22011184
Original commit changeset: 427291c456e8
fbshipit-source-id: 7d4979e98bbd9294b91da255ecfc063615741630