[webgpu] Use workgroup_idx instead of workgroup_id.x (#23696)
We should always use workgroup_idx instead of workgroup_id.x in cause
the dispatched workgroups are normalized.
When the input is large enough, the 1d workgroups will be normalized to
2d/3d and results incorrect result.