Extend CMAKE_CUDA_FLAGS with all Blackwell compute capacity (#23928)
### Description
<!-- Describe your changes. -->
* Update range to build SASS on all arch and PTX on highest arch
* when cuda>=12.8, build all arch (including latest blackwell)
### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->
https://cmake.org/cmake/help/latest/prop_tgt/CUDA_ARCHITECTURES.html
https://docs.nvidia.com/cuda/cuda-compiler-driver-nvcc/index.html#gpu-feature-list