[webgpu] Support int64 for range (#26673)
### Description
- Add new registerInt64Ops option to WebGpuExecutionProviderConfig
- Int64 support now enabled when enable_graph_capture OR
register_int64_ops is true
- Refactor Range kernel registration to support conditional int64
registration
- Update kernel registry caching to handle all 4 combinations of flags
- Rename parameters from enable_graph_capture to enable_int64 for
clarity
- Add config parsing in webgpu_provider_factory.cc for registerInt64Ops
option
### Motivation
Needed by updating position id with an onnx model in genai.
Continuous decoding mode: `position_ids[i] = i + total_length -
new_kv_length`
We can use an onnx model which includes a Range op to implement update
the position ids:
Inputs: start (total_length - new_kv_length), limit (total_length),
delta (1)
Output: position_ids (1D tensor of size new_kv_length)