Reorganize tests to make them easier to find (#768)
* Add eval loader to eval script
* small input tests
* updates
* fix typing and formatting
* fixes, add tests
* remove circular dependency
* tests pass
* nits + small fixes
* add metrics at the end, refactor to put icl/gauntlet as helpers
* NOT
* metrics instead of models, add unit tests
* Move tests into directories
* add copyright to inits
* fix relative paths
* fixes
* revert gauntlet test change
* Support inputs_embeds (#687)
* support inputs_embeds
* update tests to test inputs_embeds
* make iids optional inputs to fwd
* remove check for both iids and inputs_embeds
in MPTForCausalLM. It is checked in the base model, and it is actually a common practice to pass both during autoregressive generation. Embeds are used first, then once the kvcache is nonempty, iids are used instead
* reorder kwargs
* add more tests
* fix device merge artifact in test_model.oy
* fix generate test
* yapf
* Better error message when test does not complete (#769)
* run script tests first
* comment out
* ascripts -> scripts
* bad dirs
* try this
* hacks
* add a note about a_scripts
---------
Co-authored-by: Sam Havens <sam@mosaicml.com>