Fix RuntimeError for moe on XPU: tensors found at least two devices (#5519)
There is following error on XPU while unit testing
"DeepSpeed/tests/unit/moe/test_moe.py"
DeepSpeed/deepspeed/moe/sharded_moe.py line 223, in top1gating
RuntimeError: Expected all tensors to be on the same device, but found
at least two devices, xpu:0 and cpu!
Fix it by device conversion.
---------
Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>