follow up PR for test_c10d_ucc.py in response to Xiang's review of #88110 (#99654)
* Adds extra test_allgather_base in UccProcessGroupWithDispatchedCollectivesTests; rest of nccl and gloo tests there don't work on ucc
* Adds cpu tests for [op]_work_wait_gpu tests
* Added single tensor input test for allgather_basics; multi tensor input still doesn't seem to be supported by ucc
Pull Request resolved: https://github.com/pytorch/pytorch/pull/99654
Approved by: https://github.com/kwen2501