onnxruntime
3c5d02a9 - Implement BatchNormGradient kernel for CPU EP (#7622)

Commit

2 years ago

Implement BatchNormGradient kernel for CPU EP (#7622) **Description**: Register an implementation for BatchNormInternal and add a CPU kernel for BatchNormGradient. This is the third in a series of PRs to implement BN training on CPU (first was #6946, second was #7539). **Motivation and Context** Support training networks with BatchNorm (e.g. convnets). Also note that there exists a CUDA kernel for BN (forward training & backwards) but it's currently disabled due to flaky failures; someone more familiar with those parts can register the implementation for BNInternal on CUDA (gradient kernel doesn't have to change). --------- Co-authored-by: Simon Zirui Guo <simonguozirui@berkeley.edu> Co-authored-by: mindest <linminuser@gmail.com> Co-authored-by: mindest <30493312+mindest@users.noreply.github.com>

References

#7622 - Implement BatchNormGradient kernel for CPU EP

Author

pranav-prakash

Parents

5e2f46df

onnxruntime 3c5d02a9 - Implement BatchNormGradient kernel for CPU EP (#7622)

onnxruntime
3c5d02a9 - Implement BatchNormGradient kernel for CPU EP (#7622)