[cuDNN][TF32] Threshold adjustments for TF32 on `>=sm80` (#78437)
CC @ptrblck @mcarilli
Change to transformer multilayer test can potentially be swapped in favor of an rtol change? (see also: #75612).
Pull Request resolved: https://github.com/pytorch/pytorch/pull/78437
Approved by: https://github.com/ngimel