transformers
Fix BLT training_ci overfit test
#42685
Merged

Commits
  • Fix BLT training_ci overfit test by disabling cache and adjusting training thresholds
    preetam1407 committed 18 days ago
  • Fix BLT training_ci overfit test by disabling cache and adjusting training thresholds
    preetam1407 committed 18 days ago
  • Fix BLT training_ci overfit test by disabling cache and adjusting training thresholds
    preetam1407 committed 18 days ago
  • Format BLT tests with ruff
    preetam1407 committed 18 days ago
  • Merge branch 'main' into fix-blt-training-ci
    3outeille committed 17 days ago
  • Fix BLT training CI with custom weight initialization and overfit test
    preetam1407 committed 15 days ago
  • Fix BLT training CI with custom weight initialization and overfit test
    preetam1407 committed 15 days ago
  • Fix BLT training CI with custom weight initialization and overfit test
    preetam1407 committed 15 days ago
  • Fix BLT training CI with custom weight initialization and overfit test
    preetam1407 committed 15 days ago
  • Fix BLT training CI with custom weight initialization and overfit test
    preetam1407 committed 15 days ago
  • Fix BLT training CI with custom weight initialization and overfit test
    preetam1407 committed 15 days ago
  • Update BLT init logic and adjust repo checks for non-functional model wrappers
    preetam1407 committed 15 days ago
  • Fix repo/config checks by marking BLT Text/Vision models as placeholders
    preetam1407 committed 14 days ago
  • Fix repo/config checks by marking BLT Text/Vision models as placeholders
    preetam1407 committed 14 days ago
  • Fix repo/config checks by marking BLT Text/Vision models as placeholders
    preetam1407 committed 14 days ago
  • Merge branch 'main' into fix-blt-training-ci
    3outeille committed 14 days ago
  • Document BLT weight initialization sources and restore default overfit thresholds
    preetam1407 committed 14 days ago
  • Align BLT weight init with nn.init
    preetam1407 committed 14 days ago
  • Merge branch 'main' into fix-blt-training-ci
    3outeille committed 13 days ago
  • Fix BLT init weights and remove modular conversion issues
    preetam1407 committed 13 days ago
  • fixes circle ci failures
    preetam1407 committed 12 days ago
  • Merge branch 'main' into fix-blt-training-ci
    3outeille committed 10 days ago
  • Merge branch 'main' into fix-blt-training-ci
    3outeille committed 10 days ago
  • Merge branch 'main' into fix-blt-training-ci
    3outeille committed 10 days ago
  • fix
    preetam1407 committed 10 days ago
  • fix
    preetam1407 committed 10 days ago
  • Merge branch 'main' into fix-blt-training-ci
    3outeille committed 10 days ago
  • Merge branch 'main' into fix-blt-training-ci
    3outeille committed 10 days ago
  • fix recurrent_gemma overfit generation with cache
    preetam1407 committed 10 days ago
  • Fix recurrent_gemma overfit generation with cache
    preetam1407 committed 10 days ago
  • rerun circleci
    preetam1407 committed 10 days ago
  • rerun circleci
    preetam1407 committed 10 days ago
  • Log RecurrentGemma cache exception in training mixin
    preetam1407 committed 10 days ago
  • ci: rerun
    preetam1407 committed 10 days ago
  • ci: rerun
    preetam1407 committed 10 days ago
  • Merge branch 'main' into fix-blt-training-ci
    preetam1407 committed 10 days ago
  • Merge branch 'main' into fix-blt-training-ci
    preetam1407 committed 10 days ago
  • ci: rerun
    preetam1407 committed 10 days ago
  • Merge branch 'main' into fix-blt-training-ci
    preetam1407 committed 10 days ago
Loading