pytorch
dd2c22f4 - bsr_dense_bmm(): enable more precise float32 support with float64 accumulators (#100882)

Commit

1 year ago

bsr_dense_bmm(): enable more precise float32 support with float64 accumulators (#100882) Float64 is there in Triton! This PR increases precision for float32 inputs with float64 accumulation dtype. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100882 Approved by: https://github.com/cpuhrsch

Author

nikitaved

Committer

pytorchmergebot

Parents

979f55d3

Files2

test
- test_sparse_csr.py
torch/sparse
- _triton_ops.py

pytorch dd2c22f4 - bsr_dense_bmm(): enable more precise float32 support with float64 accumulators (#100882)

pytorch
dd2c22f4 - bsr_dense_bmm(): enable more precise float32 support with float64 accumulators (#100882)