text-generation-inference
fca2218f - Fix runtime error when Qwen2-VL was prompted with multiple images

Commit

1 year ago

Fix runtime error when Qwen2-VL was prompted with multiple images Fix runtime error when Qwen2-VL model is prompted with prompt with more than one image. The runtime error was: File "text-generation-inference/server/text_generation_server/models/custom_modeling/qwen2_vl.py", line 459, in get_position_ids text_pos_ids = torch.arange(text_length, device=d) RuntimeError: upper bound and larger bound inconsistent with step sign The error was caused by text_length variable going to negative value when multiple images caused multiple loops in the get_position_ids function's main loop. The error is a simple logic mistake where next_image_pos is initialized as relative offset from current_pos, but was used like it was absolute position from zero.

Author

alatja

Committer

drbh

Parents

a72f339c

text-generation-inference fca2218f - Fix runtime error when Qwen2-VL was prompted with multiple images

text-generation-inference
fca2218f - Fix runtime error when Qwen2-VL was prompted with multiple images