avoid kernel launches for zero-sized tensor inputs
Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22790
Test Plan: Imported from OSS
Differential Revision: D16226168
Pulled By: wanchaol
fbshipit-source-id: 081607c9acc1540c753b080c5f727dc4e8c22acc