Add eval loader to eval script (#742)
* Add eval loader to eval script
* small input tests
* updates
* fix typing and formatting
* fixes, add tests
* remove circular dependency
* tests pass
* nits + small fixes
* add metrics at the end, refactor to put icl/gauntlet as helpers
* NOT
* metrics instead of models, add unit tests