auto-round
fix cuda low_cpu_mem_usage ut
#1010
Merged

Loading