Gradient accumulation for TFTrainer (#9585)
* gradient accumulation for tftrainer
* label naming
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* label naming
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>