Add ROCm pytest workflow to nightly/release CI
Integrates the pytest_rocm.yml workflow into the main CI pipeline for nightly
and release testing. The workflow tests ROCm functionality across multiple GPU
configurations (1/4/8 GPUs), Python versions (3.11, 3.12), and Ubuntu
versions (22, 24) with ROCm 7.1.1.