Update Cutlass to v3.1 (#94188)
Now that we are on CUDA 11+ exclusively, we can update Nvidia's Cutlass to the next version.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/94188
Approved by: https://github.com/ezyang, https://github.com/jansel, https://github.com/malfet