pytorch
dd2c22f4 - bsr_dense_bmm(): enable more precise float32 support with float64 accumulators (#100882)

Commit
2 years ago
bsr_dense_bmm(): enable more precise float32 support with float64 accumulators (#100882) Float64 is there in Triton! This PR increases precision for float32 inputs with float64 accumulation dtype. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100882 Approved by: https://github.com/cpuhrsch
Author
Committer
Parents
Loading