transformers
afa43c73 - test ci training for text model only (#42597)

Commit
39 days ago
test ci training for text model only (#42597) * begin test ci training * add better logging * add better logging + training loop * fix sentence + grad_norm assert * create circlci config fort training * fix ci to detect training_ci job * add -s for pytest CI * add generate assert as well * make training ci trigger for every model change instead * set eos_token_id to 0 otherwise it will stop generating too soon * refactor * moving logging and metrics to proper files * update marker in pyproject.toml * linting * linting again * reduce pytest worker * fix deadlock in test * add license * dont show logs during test * loosen threshold a bit * Initialize cache state in RecurrentGemmaRecurrentBlock based on batch… (#42627) * Initialize cache state in RecurrentGemmaRecurrentBlock based on batch size * loosen a bit the threshold * Revert "loosen a bit the threshold" This reverts commit d3d42e1e3b632e62eab016d3660bd1172fb326eb. * skipping BLT for now (until it get fixed)
Author
Parents
Loading