Chunked Prefill VLM (#3188)
* add logic
* working
* add encoder cache free
* fixes
* fix idefics
* update pixel_values
* add improvements
* add improvements
* improve
* nit
* fix inputs_embeds
* nit
* optimizations
* add prometheus port
* rename vars
* rename vars
* nit
* disable chunking for qwen
* review comments
* remove port
* improve headdim
* remove kwargs and redundant args
* fix qwen2_5
* fix config image_token_id error
* fix test
* update paligemma
* fix paligemma text
* minor fix
* fix qwen test
* fix qwen test