[NVFuser] multithreading nvfuser test
1) add multithreading tests
2) make IrParser thread safe with std::call_once (previously, registerJitOperator could get called twice simultaneously and segfault)
Pull Request resolved: https://github.com/pytorch/pytorch/pull/76259
Approved by: https://github.com/jjsjann123