pytorch
2166fc55 - improve softmax lastdim performance on bfloat16 by adding more fusion

Commit
2 years ago
improve softmax lastdim performance on bfloat16 by adding more fusion Pull Request resolved: https://github.com/pytorch/pytorch/pull/76278 Approved by: https://github.com/frank-wei
Author
Committer
Parents
Loading