Avoid importing models in parent process, instead just iter names
- something in torchbench import was creating a python process and
allocating GPU memory. Possibly a dataloader. This was persisting
in the background while children subprocess then tried to work
- avoid importing torchbench models in the parent process but now need
to handle all checking (NotImplemented, etc.) in child process