Always synchronize src and dst streams when copying tensors (#16966)
Summary:
fixes #15568
Pull Request resolved: https://github.com/pytorch/pytorch/pull/16966
Differential Revision: D14213144
Pulled By: mrshenli
fbshipit-source-id: 2fcf5e07895fde80b4aee72e2736b0def876d21f