benchmark
[WIP] Use sync-free cuda event timing in benchmark
#999
Open

Loading