Output eval logging batch (#961)
* Skip flaky lion8b test (#598)
* relax atol and add retries to reduce flakiness in lion8b timing test
* add eval output logging
* add back tasks
* foo
* add rlhf prompts
* add rlhf prompts
* add rlhf prompts
* add rlhf prompts
* add rlhf prompts
* fix prompt
* fix prompt
* modify mcli
* test
* test
* fix
* fix merge
* wip
* merge
* reset files, wip commit
* rm small changes
* reduce changes
* reduce changes
* .
* wip
* rm batch keys
* revert init device
* linting
* add import
* fix import
* add eval_output_logging to registry
* readd import
* pyright + linting
---------
Co-authored-by: dblalock <davis@mosaicml.com>
Co-authored-by: Jeremy Dohmann <jeremy@mosaicml.com>