Pass `with_cuda` arg for jit_load in OpBuilder (#7226)
Torch loads and hipify JIT C++ extension by determining whether CUDA
headers and libraries are added to the build, based on the existence of
`.cu` or `.cuh` in `sources`, if we let `with_cuda` to be the default
`None`.
https://github.com/pytorch/pytorch/blob/2a909cab1699e2be26fc7d01c7c2d20c726e1be6/torch/utils/cpp_extension.py#L1623-L1627
While for some Ops, such as DeepCompile, there are no `.cu` or `.cuh`
files in the sources, but we still need to do the hipify on AMD as it
includes several CUDA headers in the C++ code. So, it's better for us to
control this behavior if it's not `build_for_cpu`, otherwise, the hipify
will get skipped.
Signed-off-by: Hollow Man <hollowman@opensuse.org>