mul(dense, csr), mul(csr, dense) via sparse_mask_csr (#77177)
This adds basic coverage, but can be easily made more efficient by providing a native implementation.
Follow up work includes supporting CSR gradients for strided Tensors.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/77177
Approved by: https://github.com/nikitaved, https://github.com/mikaylagawarecki