Fill CUDA EP opset gap for Sin and Cos operators (7→22) (#27756)
- [x] Cap existing opset 7 Sin/Cos kernels to versioned 7-21
- [x] Add new opset 22 Sin/Cos kernels with BFloat16 support (HFDX)
- [x] Update forward declarations and BuildKernelCreateInfo entries
- [x] Add opset 22 tests for Sin and Cos
- [x] Rebase onto latest main, resolve conflicts
- [x] Merge commit with latest main to resolve conflicts
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: tianleiwu <30328909+tianleiwu@users.noreply.github.com>
Co-authored-by: justinchuby <11205048+justinchuby@users.noreply.github.com>
Co-authored-by: Tianlei Wu <tlwu@microsoft.com>