transformers
Extract dynamic vision/audio tensors into standalone pure functions
#45396
Merged

Extract dynamic vision/audio tensors into standalone pure functions #45396

IlyasMoutawwakil merged 86 commits into main from hf-vision-audio-utils
IlyasMoutawwakil
IlyasMoutawwakil Extract pure vision/audio functions into standalone utilities
05d2d21f
IlyasMoutawwakil Fix stale compute_ docstring references to match actual function names
fe46ba2e
IlyasMoutawwakil Revert mlcd changes — not part of this PR
84439a04
IlyasMoutawwakil fix
e62aa984
IlyasMoutawwakil Merge branch 'main' into hf-vision-audio-utils
cbc1e22a
HuggingFaceDocBuilderDev
IlyasMoutawwakil kwargs
c1d7a8a5
IlyasMoutawwakil opt-in
27717996
IlyasMoutawwakil fix dtype
fa224e20
IlyasMoutawwakil style
ac2895d1
IlyasMoutawwakil guard torch import
2f2787c9
IlyasMoutawwakil standarize
d628d966
IlyasMoutawwakil propagate inputs
2a014a4a
IlyasMoutawwakil fix docs
957372a1
IlyasMoutawwakil fix docs
4194ff1f
IlyasMoutawwakil auto docs
836424bf
IlyasMoutawwakil more docs fixing
11f73fd4
IlyasMoutawwakil fix omni
71f90eca
IlyasMoutawwakil fix paddle
a89d4369
IlyasMoutawwakil revert paddle ocr until another time
c0fdc0da
IlyasMoutawwakil finally fixed paddle ocr
d1da0229
IlyasMoutawwakil IlyasMoutawwakil requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 65 days ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-04-13
IlyasMoutawwakil fix review
448ff2ee
IlyasMoutawwakil revert chunking
6731028e
IlyasMoutawwakil IlyasMoutawwakil requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 65 days ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-04-13
IlyasMoutawwakil Potential fix for pull request finding
693ba9cf
IlyasMoutawwakil Potential fix for pull request finding
d701016f
IlyasMoutawwakil fix torch compilable check
5472c4f3
IlyasMoutawwakil Merge branch 'hf-vision-audio-utils' of https://github.com/huggingfac…
12a416c1
IlyasMoutawwakil fix docs
4e7739bf
IlyasMoutawwakil correct func name
47fed92c
IlyasMoutawwakil fix omni
18a17889
IlyasMoutawwakil IlyasMoutawwakil marked this pull request as ready for review 65 days ago
IlyasMoutawwakil fix video llama 3
4c6e1dfe
zucchini-nlp
zucchini-nlp commented on 2026-04-13
IlyasMoutawwakil fix video llama 3
247b445a
IlyasMoutawwakil
github-actions
github-actions
IlyasMoutawwakil requires torch
3c5e9a8a
IlyasMoutawwakil add missing grid device
27677eda
IlyasMoutawwakil keep rot emb in fp32
45a03e42
IlyasMoutawwakil fix test device
5f3d2ae6
IlyasMoutawwakil
github-actions
github-actions
ArthurZucker
ArthurZucker commented on 2026-04-14
IlyasMoutawwakil fix flm4v flex attention test
1feb220a
IlyasMoutawwakil rename to vision utils
e4c41380
IlyasMoutawwakil only one get_rotary_pos_ids is needed
d401a332
IlyasMoutawwakil style
fc49a3f9
IlyasMoutawwakil Merge branch 'main' into hf-vision-audio-utils
fc4bf669
IlyasMoutawwakil style
4711af6a
vasqu
vasqu commented on 2026-04-16
zucchini-nlp
vasqu
IlyasMoutawwakil
IlyasMoutawwakil deprecate only
e85551e0
IlyasMoutawwakil fix
531f13cc
IlyasMoutawwakil simplify and revert processor changes
4c3d84d6
IlyasMoutawwakil
IlyasMoutawwakil renames
9ea6203d
IlyasMoutawwakil move some stuff to their original place
67b09067
IlyasMoutawwakil style
b8323fb8
IlyasMoutawwakil Merge branch 'main' into hf-vision-audio-utils
205e94d0
IlyasMoutawwakil style
a6b071f0
IlyasMoutawwakil Merge branch 'main' into hf-vision-audio-utils
5dcc3ea8
IlyasMoutawwakil
github-actions
zucchini-nlp
zucchini-nlp commented on 2026-04-22
github-actions
IlyasMoutawwakil use chunked attention
e9ac058e
IlyasMoutawwakil use decorator
a7c22775
IlyasMoutawwakil IlyasMoutawwakil force pushed from 86e00129 to a7c22775 50 days ago
IlyasMoutawwakil Merge branch 'main' into hf-vision-audio-utils
85556ab3
IlyasMoutawwakil pass kwargs and return_dict
6d33d4a1
IlyasMoutawwakil fix missing
fe3bcc40
IlyasMoutawwakil keep in and get from kwargs
51f7e206
IlyasMoutawwakil revert some trailing commas
4838e179
IlyasMoutawwakil fix
4246a72d
IlyasMoutawwakil IlyasMoutawwakil requested a review from vasqu vasqu 50 days ago
IlyasMoutawwakil IlyasMoutawwakil requested a review from zucchini-nlp zucchini-nlp 50 days ago
IlyasMoutawwakil
github-actions
github-actions
IlyasMoutawwakil fixes
62d901c6
IlyasMoutawwakil
github-actions
github-actions
IlyasMoutawwakil video llama fixes
1481de69
IlyasMoutawwakil fix qwen3 vl
3538f47d
IlyasMoutawwakil
IlyasMoutawwakil Merge branch 'main' into hf-vision-audio-utils
2abfe3a5
IlyasMoutawwakil forgot glm ocr
e44ec7b9
IlyasMoutawwakil
github-actions
github-actions
vasqu
vasqu approved these changes on 2026-04-30
zucchini-nlp
zucchini-nlp commented on 2026-05-01
IlyasMoutawwakil address comments
c5346370
IlyasMoutawwakil Merge branch 'main' into hf-vision-audio-utils
43363815
IlyasMoutawwakil drop unnecessary
6a2808b4
IlyasMoutawwakil use correct flash attn check
48f0332f
IlyasMoutawwakil missed deprecation
2c448454
IlyasMoutawwakil Merge branch 'main' into hf-vision-audio-utils
f4f830bf
IlyasMoutawwakil Merge branch 'main' into hf-vision-audio-utils
df321cda
IlyasMoutawwakil empty commit 1
1d67c0ac
IlyasMoutawwakil empty commit 2
b2f75bc7
IlyasMoutawwakil Merge branch 'hf-vision-audio-utils' of https://github.com/huggingfac…
bff78191
IlyasMoutawwakil revert video llama 3 config changes
7c39239e
IlyasMoutawwakil Merge branch 'main' into hf-vision-audio-utils
a3fb74e0
IlyasMoutawwakil style
b7b18e86
IlyasMoutawwakil style fix
4556de45
IlyasMoutawwakil Merge branch 'main' into hf-vision-audio-utils
891dcac3
IlyasMoutawwakil Merge branch 'main' into hf-vision-audio-utils
78142ddb
vasqu
vasqu approved these changes on 2026-05-06
IlyasMoutawwakil address comments
d1f63d2a
IlyasMoutawwakil remove unnecessary
34382955
IlyasMoutawwakil Merge branch 'main' into hf-vision-audio-utils
d4b3e1a0
IlyasMoutawwakil Merge branch 'main' into hf-vision-audio-utils
03decc38
IlyasMoutawwakil revert TransformersKwargs and add a todo
1f7a9c37
IlyasMoutawwakil Merge branch 'main' into hf-vision-audio-utils
a4565acb
ebezzam
ebezzam commented on 2026-05-12
IlyasMoutawwakil
IlyasMoutawwakil Merge branch 'main' into hf-vision-audio-utils
595b6f16
github-actions
IlyasMoutawwakil IlyasMoutawwakil merged f00940ef into main 35 days ago
IlyasMoutawwakil IlyasMoutawwakil deleted the hf-vision-audio-utils branch 35 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone