Fix CUDA index out of bounds for q_idx in VLM token type masking for Gemma3, PaliGemma, and example modular (#41757)
* Fix CUDA index out of bounds for q_idx in Gemma3 token type masking
* Fix CUDA index out of bounds for q_idx in modular modeling_new_task_model
* Revert "Fix CUDA index out of bounds for q_idx in Gemma3 token type masking"
This reverts commit f8e5c2a42c305aebd00c46161bf22f520009c8fc.
* Fix CUDA index out of bounds for q_idx in PaliGemma token type masking
* Fix CUDA index out of bounds for q_idx in Gemma3 token type masking