benchmark
Remove warmups in correctness check and run each benchmark only once.
#358
Merged

Loading