Fix AMD multi-tensor implementation. #5997
Fix AMD multi-tensor implementation.
f450c8a1
Re-enable Lamb unit tests for AMD
bde18bc2
Use __launch_bounds__ workaround, rather than limiting threads to 256…
c5ea2d8b
jessebenson
deleted the jesseb/amd-multi-tensor branch 5 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub