transformers
4d0ea3d2 - Cuda rng_state_all is used when saving in distributed mode so same should also be used when loading (#23045)

Loading