Expose runtime errors in the stableness test (#1487)
Summary:
We should treat the stableness test as sort of stress-test. We should fix any runtime errors that exposed in the stableness test.
Test workflow: https://github.com/pytorch/benchmark/actions/runs/4471135231
Pull Request resolved: https://github.com/pytorch/benchmark/pull/1487
Reviewed By: weiwangmeta
Differential Revision: D44221086
Pulled By: xuzhao9
fbshipit-source-id: d6bbf0c981df8117db92614701016d6a4c489798