Add strategy to store results in evaluation loop (#30267)
* Add evaluation loop container for interm. results
* Add tests for EvalLoopContainer
* Formatting
* Fix padding_index in test and typo
* Move EvalLoopContainer to pr_utils to avoid additional imports
* Fix `eval_do_concat_batches` arg description
* Fix EvalLoopContainer import