[Bugfix] fix video inference of qwen3vl and qwen3.5 series (#44474)
* fix video inference of qwen3vl and qwen3.5 series
* fix ci
* get_rope_index inhert from qwen2vl
* add test for qwen3vl,qwen3.5
* fix ci
* use assertListEqual instead
* update CI result