optimum
BetterTransformer support training & autocast for all archs
#1225
Merged

Commits
  • support training
    fxmarty committed 2 years ago
  • encoders and encoder+decoder all work
    fxmarty committed 2 years ago
  • warning about training decoders with padding
    fxmarty committed 2 years ago
  • leave to an other PR the backward for some archs
    fxmarty committed 2 years ago
  • nit
    fxmarty committed 2 years ago
  • fix tests
    fxmarty committed 2 years ago
  • hopefully tests pass
    fxmarty committed 2 years ago
  • fix
    fxmarty committed 2 years ago
Loading