llama.cpp
ec2b787e - mtmd: Add dynamic high-resolution image preprocessing for InternVL model (#20847)

Commit
4 days ago
mtmd: Add dynamic high-resolution image preprocessing for InternVL model (#20847) * added support for internvl's dynamic high-resolution (Qianfan-OCR needed) * add min/max dynamic patch to gguf meta * clean up * simplified handling min/max dynamic patch * reuse llava_uhd logic for slice images * provide default values for older models * flake8 * prevent writing 0 value to gguf * remove duplicated resolution candidates with a better algorithm * fix indentation * format * add protection from divide by zero * change to 0 to be safe --------- Co-authored-by: Xuan Son Nguyen <son@huggingface.co>
Author
Parents
Loading