transformers
f690a2a1 - [video processors] decode only sampled videos -> less RAM and faster processing (#39600)

Commit

175 days ago

[video processors] decode only sampled videos -> less RAM and faster processing (#39600) * draft update two models for now * batch update all VLMs first * update some more image processors * update * fix a few tests * just make CI green for now * fix copies * update once more * update * unskip the test * fix these two * fix torchcodec audio loading * maybe * yay, i fixed torchcodec installation and now can actually test it * fix copies deepseek * make sure the metadata is returrned when users request it * add docs * update * fixup * Update src/transformers/audio_utils.py Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com> * Update src/transformers/models/glm4v/video_processing_glm4v.py Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com> * update * what if we set some metadata attr to `None` * fix CI * fix one test * fix 4 channel test * fix glm timestemps * rebase gone wrong * raise warning once * fixup * typo * fix copies * ifx smolvlm test * this is why torch's official benchmark was faster, set threads to `0` * Apply style fixes --------- Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

References

#39600 - [video processors] decode only sampled videos -> less RAM and faster processing

Author

zucchini-nlp

Parents

64ae6e6b

transformers f690a2a1 - [video processors] decode only sampled videos -> less RAM and faster processing (#39600)

transformers
f690a2a1 - [video processors] decode only sampled videos -> less RAM and faster processing (#39600)