transformers
Optimize by not computing gradients for parameters set to requires_grad=False
#21236
Merged

Loading