DeepSpeed
Use CUDA events for inference model profiling
#2371
Merged

Loading