DeepSpeed
f26900ec
- Modify normalize kernels to save variance in floating-point for backward
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Previous Change (CTRL+↑)
Next Change (CTRL+↓)
Expand Context Lines
Collapse Context Lines
Hide Minimap (CTRL+M)
Commit
3 years ago
Modify normalize kernels to save variance in floating-point for backward
References
#1404 - Transformer/fix layer norm
Author
Reza Yazdani
Parents
4017a7c9
Files
6
csrc
includes
custom_cuda_layers.h
ds_transformer_cuda.h
normalize_layer.h
transformer
ds_transformer_cuda.cpp
normalize_kernels.cu
deepspeed/ops/transformer
transformer.py
Loading