Prepare and keep track of position ids in `generate` (#43734)
* draft smth for now
* qwens
* fix assisted decoding, kinda. Get back later
* fix assisted decoding same model
* pad tokens should have a position ids of `0`, not `1`
* qwens fix generate
* other 3d rope models
* glm image
* colqwen
* fix repo messed up with args, fix
* fix repo
* fix qwen positions
* fix?
* make sure the test uses left padded input!
* nit
* update qwen3-5
* fix slow tests, ernie will fix its padding side in hub config
* fix repo
* ernie!
* qwen2-vl