onnxruntime
34f77eaa - bfloat16 support for quickgelugrad (#18336)

Commit
2 years ago
bfloat16 support for quickgelugrad (#18336) ### Description <!-- Describe your changes. --> Registers BFloat16 datatype as valid input type for CUDA QuickGeluGrad Kernel. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> Enabling `meta-llama/Llama-2-70b` to be finetuned with ONNX Runtime training. --------- Co-authored-by: Prathik Rao <prathikrao@microsoft.com@orttrainingdev8.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>
Author
Parents
Loading