Reset NVIDIA devices stuck in failed mode (#88459)
Try to reset the NVIDIA devices if they get stuck in failed mode per comment in https://github.com/pytorch/pytorch/issues/88388
Pull Request resolved: https://github.com/pytorch/pytorch/pull/88459
Approved by: https://github.com/malfet