Add asymmetric shapes to test sum Triton kernels (#2310)
Summary:
Pull Request resolved: https://github.com/pytorch/benchmark/pull/2310
Add asymmetric shapes to `get_input_iter()` in order to test accuracy and performance of `sum` Triton kernel implementations against PyTorch.
This diff generates tensors with dimensions of different sizes. For example, a 2D asymmetric tensor would have shape `(n, n + 3)`; a 3D asymmetric tensor would have shape `(n, n + 3, n + 6)`.
Reviewed By: jbschlosser
Differential Revision: D58509022
fbshipit-source-id: 98e0eacbef6f1cdd09ebc8289582a2eb70118506