Add cuSOLVER path for torch.linalg.qr (#56256)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/56256
Using cuSOLVER path with `pytest test/test_ops.py -k 'linalg_qr'
--durations=5` cuts the runtime for these tests by 1 minute locally. See https://github.com/pytorch/pytorch/pull/56256#issuecomment-821069086.
Performance comparison: https://github.com/pytorch/pytorch/pull/56256#issuecomment-821077712.
Test Plan: Imported from OSS
Reviewed By: ngimel
Differential Revision: D27960154
Pulled By: mruberry
fbshipit-source-id: 5312330d82337dec2856ec5527156a3a547a0b50