[mGPU] Use CPU target config, that is passed from XLA side via FFI.
Currently, the host triple is detected during the compilation of the host code. This works well if the kernel is compiled and run on the same machine. To support cross-compilation, this change allows to use the passed CPU target architecture directly.
PiperOrigin-RevId: 895789138