[CUDA] Rollback TileMemcpy and TileBatchedMemcpy when Block Size is Small #11187
switch kernel impl by block size
61019b5e
fix win build
9506df09
pengwa
commented
on 2022-04-14
pengwa
commented
on 2022-04-14
pengwa
commented
on 2022-04-14
pengwa
commented
on 2022-04-14
pengwa
dismissed these changes
on 2022-04-14
rename some var
dbb6c7c2
Lafi7e
dismissed their stale review
via dbb6c7c2
4 years ago
Lafi7e
dismissed their stale review
via dbb6c7c2
4 years ago
pengwa
dismissed these changes
on 2022-04-14
Merge branch 'master' into weicwang/tile_perf
91a36632
fix typo
0dcf692c
Lafi7e
dismissed their stale review
via 0dcf692c
4 years ago
Lafi7e
merged
0bad5b1b
into master 4 years ago
Lafi7e
deleted the weicwang/tile_perf branch 4 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub