pytorch
9130ab38 - fix gemm call for CUDABlas for THCUNN conv, #23545 (#23552)

Commit

5 years ago

fix gemm call for CUDABlas for THCUNN conv, #23545 (#23552) Summary: * Swapped `CUBLAS_OP_N` for `'n'` * added a test This PR should fix https://github.com/pytorch/pytorch/issues/23545. Thanks at AlphabetMan for reporting the initial issue reported in [the forum](https://discuss.pytorch.org/t/cuda-10-1-error-using-transposeconv2d-with-output-padding-1/51414?u=ptrblck) as well as ngimel for the guidance. Pull Request resolved: https://github.com/pytorch/pytorch/pull/23552 Differential Revision: D16580986 Pulled By: ezyang fbshipit-source-id: abc0bce1e84d9c9d96d44ae0296951725adc8424

Author

ptrblck

Committer

facebook-github-bot

Parents

5d130e42

pytorch 9130ab38 - fix gemm call for CUDABlas for THCUNN conv, #23545 (#23552)

pytorch
9130ab38 - fix gemm call for CUDABlas for THCUNN conv, #23545 (#23552)