Extract dynamic vision/audio tensors into standalone pure functions #45396
Extract pure vision/audio functions into standalone utilities
05d2d21f
Fix stale compute_ docstring references to match actual function names
fe46ba2e
Revert mlcd changes — not part of this PR
84439a04
fix
e62aa984
Merge branch 'main' into hf-vision-audio-utils
cbc1e22a
kwargs
c1d7a8a5
opt-in
27717996
fix dtype
fa224e20
style
ac2895d1
guard torch import
2f2787c9
standarize
d628d966
propagate inputs
2a014a4a
fix docs
957372a1
fix docs
4194ff1f
auto docs
836424bf
more docs fixing
11f73fd4
fix omni
71f90eca
fix paddle
a89d4369
revert paddle ocr until another time
c0fdc0da
finally fixed paddle ocr
d1da0229
fix review
448ff2ee
revert chunking
6731028e
Potential fix for pull request finding
693ba9cf
Potential fix for pull request finding
d701016f
fix torch compilable check
5472c4f3
Merge branch 'hf-vision-audio-utils' of https://github.com/huggingfac…
12a416c1
fix docs
4e7739bf
correct func name
47fed92c
fix omni
18a17889
fix video llama 3
4c6e1dfe
fix video llama 3
247b445a
requires torch
3c5e9a8a
add missing grid device
27677eda
keep rot emb in fp32
45a03e42
fix test device
5f3d2ae6
fix flm4v flex attention test
1feb220a
rename to vision utils
e4c41380
only one get_rotary_pos_ids is needed
d401a332
style
fc49a3f9
Merge branch 'main' into hf-vision-audio-utils
fc4bf669
style
4711af6a
vasqu
commented
on 2026-04-16
deprecate only
e85551e0
fix
531f13cc
simplify and revert processor changes
4c3d84d6
renames
9ea6203d
move some stuff to their original place
67b09067
style
b8323fb8
Merge branch 'main' into hf-vision-audio-utils
205e94d0
style
a6b071f0
Merge branch 'main' into hf-vision-audio-utils
5dcc3ea8
use chunked attention
e9ac058e
use decorator
a7c22775
Merge branch 'main' into hf-vision-audio-utils
85556ab3
pass kwargs and return_dict
6d33d4a1
fix missing
fe3bcc40
keep in and get from kwargs
51f7e206
revert some trailing commas
4838e179
fix
4246a72d
fixes
62d901c6
video llama fixes
1481de69
fix qwen3 vl
3538f47d
Merge branch 'main' into hf-vision-audio-utils
2abfe3a5
forgot glm ocr
e44ec7b9
vasqu
approved these changes
on 2026-04-30
address comments
c5346370
Merge branch 'main' into hf-vision-audio-utils
43363815
drop unnecessary
6a2808b4
use correct flash attn check
48f0332f
missed deprecation
2c448454
Merge branch 'main' into hf-vision-audio-utils
f4f830bf
Merge branch 'main' into hf-vision-audio-utils
df321cda
empty commit 1
1d67c0ac
empty commit 2
b2f75bc7
Merge branch 'hf-vision-audio-utils' of https://github.com/huggingfac…
bff78191
revert video llama 3 config changes
7c39239e
Merge branch 'main' into hf-vision-audio-utils
a3fb74e0
style
b7b18e86
style fix
4556de45
Merge branch 'main' into hf-vision-audio-utils
891dcac3
Merge branch 'main' into hf-vision-audio-utils
78142ddb
vasqu
approved these changes
on 2026-05-06
address comments
d1f63d2a
remove unnecessary
34382955
Merge branch 'main' into hf-vision-audio-utils
d4b3e1a0
Merge branch 'main' into hf-vision-audio-utils
03decc38
revert TransformersKwargs and add a todo
1f7a9c37
Merge branch 'main' into hf-vision-audio-utils
a4565acb
Merge branch 'main' into hf-vision-audio-utils
595b6f16
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub