onnxruntime
Fix perf issue in Conv CUDA kernel
#7348
Merged

Fix perf issue in Conv CUDA kernel #7348

SherlockNoMad merged 6 commits into master from bahuang/conv_fix
SherlockNoMad
Fix perf issue in Conv CUDA kernel
caf652cb
SherlockNoMad SherlockNoMad added core runtime
SherlockNoMad SherlockNoMad requested a review from duli2012 duli2012 4 years ago
SherlockNoMad SherlockNoMad requested a review from weixingzhang weixingzhang 4 years ago
SherlockNoMad SherlockNoMad requested a review from RandySheriffH RandySheriffH 4 years ago
SherlockNoMad SherlockNoMad requested a review 4 years ago
hariharans29
hariharans29 commented on 2021-04-15
Read avaiable memory from device
9235444d
SherlockNoMad
SherlockNoMad commented on 2021-04-15
SherlockNoMad
SherlockNoMad commented on 2021-04-15
assuming 10% fragmentation
ebc1b735
Merge branch 'master' into bahuang/conv_fix
fc34fc71
fix build
3eb1cf1f
Merge branch 'master' into bahuang/conv_fix
f30aef6e
duli2012
duli2012 commented on 2021-04-16
vadimkantorov
SherlockNoMad
vadimkantorov
duli2012
duli2012 approved these changes on 2021-04-19
SherlockNoMad SherlockNoMad merged ce7ff27b into master 4 years ago
SherlockNoMad SherlockNoMad deleted the bahuang/conv_fix branch 4 years ago
vadimkantorov

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone