[ci] clean up some multigpu tests, and add funcol test (#107153)
Add the funcol tests to multigpu tests to ensure it runs on CI
Pull Request resolved: https://github.com/pytorch/pytorch/pull/107153
Approved by: https://github.com/kumpera
ghstack dependencies: #107151, #107152