onnxruntime
Fix AMD multi-tensor implementation.
#5997
Merged

Fix AMD multi-tensor implementation. #5997

jessebenson merged 3 commits into master from jesseb/amd-multi-tensor
jessebenson
jessebenson Fix AMD multi-tensor implementation.
f450c8a1
jessebenson jessebenson requested a review 5 years ago
jessebenson jessebenson requested a review from weixingzhang weixingzhang 5 years ago
jessebenson jessebenson requested a review from suffiank suffiank 5 years ago
weixingzhang
jessebenson Re-enable Lamb unit tests for AMD
bde18bc2
jessebenson
jessebenson Use __launch_bounds__ workaround, rather than limiting threads to 256…
c5ea2d8b
weixingzhang
weixingzhang approved these changes on 2020-12-02
jessebenson jessebenson merged 14f6eb14 into master 5 years ago
jessebenson jessebenson deleted the jesseb/amd-multi-tensor branch 5 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone