onnxruntime
Using vectorized loads (float2) for fp16 to improve performance
#11390
Merged

Using vectorized loads (float2) for fp16 to improve performance #11390

hubertlu-tw
Using vectorized loads (float2) for fp16 to improve performance
664bb50e
tianleiwu
Fix a few warnings from cpplint
68904e56
Fix a few warnings from cpplint
5dc6cb5c
hariharans29
hariharans29 commented on 2022-04-29
hariharans29
hariharans29 commented on 2022-04-29
hubertlu-tw Use __float2half2_rn and fix some cpplint warnings
64821fb1
tianleiwu
tianleiwu commented on 2022-05-03
Move some computaions to LaunchFastGeluKernel
4e998546
hubertlu-tw
Fix some Lint C++ warning
e8c19264
hariharans29
hariharans29
hariharans29
azure-pipelines
azure-pipelines
tianleiwu
tianleiwu approved these changes on 2022-05-05
tianleiwu
hariharans29 hariharans29 merged 2a90922f into master 3 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone