Move inf_or_nan_tracker to cpu for cpu offload (#5826)
Must use the same device as grad_partitions_flat_buffer
---------
Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>