transformers
disable use_cache if using gradient checkpointing
#30320
Merged

Commits
  • disable use_cache if using gradient checkpointing
    chenzizhao committed 1 year ago
Loading