Add flag for dynamo+ddp optimizations
Add a flag that can be used to turn dynamo+ddp optimizations on. This
will be used to compare how dynamo+ddp performs with and without the
additional graph break strategy for improving dynamo+ddp
compute/communication overlap.
ghstack-source-id: b590e00c8f923469faf0457dffe6338abcce4d7d
Pull Request resolved: https://github.com/pytorch/benchmark/pull/1221