preserve output tensor's stride in TI's fast setup (#38895)
Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38895
Test Plan: Imported from OSS
Differential Revision: D21696586
Pulled By: glaringlee
fbshipit-source-id: c7206dbcf74d30998544e221cd0c998c4c25663a