[CI] Use different subdirectories for amp and float32 nightly perf run (#96470)
Summary: runner.py deletes its output_dir as its first step, so we need
to keep two separate subdirectories.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/96470
Approved by: https://github.com/huydhn