Fix a potential race in the CUDA TopK kernel (#19917)
### Description
If the `K` value is flowing through as a tensor, we are updating a
mutable member of the `TopK` class and basing the compute off that -
which is likely to cause data race issues with concurrent Run() calls
and `K` value changes.
### Motivation and Context
Fix potential race in CUDA TopK kernel