onnxruntime
Using standard layernorm cuda kernel for skiplayernorm.
#15076
Merged

Using standard layernorm cuda kernel for skiplayernorm. #15076

yufenglee merged 4 commits into main from zhalei/stable_skip_layernorm
zhanghuanrong
zhanghuanrong Using standard layernorm cuda kernel for skiplayernorm.
d993649a
zhanghuanrong zhanghuanrong requested a review from hariharans29 hariharans29 3 years ago
zhanghuanrong zhanghuanrong requested a review from yufenglee yufenglee 3 years ago
zhanghuanrong better resource calc.
ac105925
tianleiwu
hariharans29
hariharans29 commented on 2023-03-16
hariharans29
hariharans29
hariharans29 commented on 2023-03-16
zhanghuanrong zhanghuanrong requested a review from souptc souptc 3 years ago
zhanghuanrong zhanghuanrong requested a review from Lafi7e Lafi7e 3 years ago
zhanghuanrong Fix no enough resource issue. Ensure power of 2 on the threads_y.
b8182259
zhanghuanrong Strictly using internal calculation type with gamma and beta.
c6d3e53d
yufenglee
yufenglee approved these changes on 2023-03-23
yufenglee yufenglee merged 910fc09d into main 3 years ago
yufenglee yufenglee deleted the zhalei/stable_skip_layernorm branch 3 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone