transformers
Speedup T5 Flax training by using Numpy instead of JAX for batch shuffling
#15963
Merged

Loading