Apply clang-tidy perf fixes to aten (#91772)
Mostly just automated fixes to get rid of implicit copies. I also fixed on clang-tidy NOLINT comment that was in the wrong spot. Split off from #91559
Pull Request resolved: https://github.com/pytorch/pytorch/pull/91772
Approved by: https://github.com/soumith