[Inductor] Unify Inductor CUDA & CPUT unit tests input clone function (#99118)
Inductor CUDA unit tests doesn't preserve ```storage_offset``` when cloning input, this PR fixed it by making both CUDA and CPU tests use the same helper function ```clone_preserve_strides```.
This was found by @lantiankaikai when he was working on #98483, but he can't test it due to lack of CUDA env.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/99118
Approved by: https://github.com/ngimel