Avoid using ReduceOp->output_args() in rfactor (#52177)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/52177
I'm trying to get rid of `output_args` for reductions, because they
shouldn't be necessary; it's reducing over its reduction axis, why
does it need to know where its output is going?
Rfactor is probably the trickiest place where we use output_args, but
it looks like it's mostly just carrying around the location of the
store, so use that instead.
ghstack-source-id: 121813072
Test Plan:
build/bin/test_tensorexpr && build/bin/tensorexpr_bench
Imported from OSS
Reviewed By: navahgar
Differential Revision: D26420548
fbshipit-source-id: aeab564c6113fa02eabb14c9b70c7edfd05b264d