Gauntlet v0.1 (#674)
* update yaml
* adding datasets
* adding datasets
* added agi eval
* test CoT eval
* fix broken eval yaml
* fix broken eval yaml
* debugging
* debugging
* commit
* commit
* commit
* commit
* commit
* restore mcli
* adding simple tasks
* add simple human_eval
* fix yaml
* fix yaml
* remove breakpoint
* remove breakpoint
* change bsz
* merge main
* eval gauntlet cb
* add udpated readme
* fix precommit
* add pii
* restor line
* restor line
* add execution predicrtion
* add execution prediction
* add execution prediction
* change mosaicml reqs
* change mosaicml reqs
* fix error
* comment
* test smaller beams
* tesT
* tesT
* tesT
* add coding task
* tesT
* finish eval
* finish data
* fix
* fix
* remove strategyqa cot
* remove
* remove
* foo
* edit
* fix
* rm breakpoint
* rm breakpoint
* remove execution prediction; make coding optional
* remove execution prediction; make coding optional
* remove import
* remove import
* restore files
* restore
* restore
* update readm; rename gauntlet yamls
* edit yamls
* fix yamllint
* restore mpt eval
---------
Co-authored-by: Michael Carbin <michael.carbin@databricks.com>
Co-authored-by: Daniel King <43149077+dakinggg@users.noreply.github.com>