transformers
91fb62d0 - Speedup training by using numpy instead of jnp for batch shuffling (#15963)

Loading