[inductor][easy] use num_stages=1 for reduction (#98524)
Since num_stages only matters for matmul and does not matter for pointwise/reduction, set num_stage to 1 uniformly for all reductions in this PR.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/98524
Approved by: https://github.com/ngimel