pytorch
f6a4c80a - Refactor cuDNN Convolution memory format and Conv-Bias-Relu code (#65594)

Commit View On GitHub

Commit

2 years ago

Refactor cuDNN Convolution memory format and Conv-Bias-Relu code (#65594) Summary: This PR makes several changes: - Changed function `bool cudnn_conv_use_channels_last(...)` to `at::MemoryFormat cudnn_conv_suggest_memory_format(...)` - Removed `resize_` in cudnn convolution code. Added a new overloading method `TensorDescriptor::set` that also passes the desired memory format of the tensor. - Disabled the usage of double + channels_last on cuDNN Conv-Relu and Conv-Bias-Relu. Call `.contiguous(memory_format)` before passing data to cuDNN functions. - Disabled the usage of cuDNN fused Conv-Bias-Relu in cuDNN < 8.0 version due to a CUDNN_STATUS_NOT_SUPPORTED error. Instead, use the native fallback path. - Let Conv-Bias-Relu code respect the global `allow_tf32` flag. From cuDNN document, double + NHWC is genenrally not supported. Close https://github.com/pytorch/pytorch/pull/66968 Fix https://github.com/pytorch/pytorch/issues/55301 Pull Request resolved: https://github.com/pytorch/pytorch/pull/65594 Reviewed By: jbschlosser, malfet Differential Revision: D32175766 Pulled By: ngimel fbshipit-source-id: 7ba079c9f7c46fc56f8bfef05bad0854acf380d7

References

#68130 - Merge master

Author

xwang233

Committer

facebook-github-bot

Parents

cdd5d164

pytorch f6a4c80a - Refactor cuDNN Convolution memory format and Conv-Bias-Relu code (#65594)

Commit

pytorch
f6a4c80a - Refactor cuDNN Convolution memory format and Conv-Bias-Relu code (#65594)