transformers
Add Phi4 multimodal
#36939
Merged

Add Phi4 multimodal #36939

Cyrilvallez merged 113 commits into main from phi4
Cyrilvallez
github-actions github-actions marked this pull request as draft 1 year ago
github-actions
Cyrilvallez Cyrilvallez marked this pull request as ready for review 1 year ago
github-actions github-actions requested a review from ArthurZucker ArthurZucker 1 year ago
github-actions github-actions requested a review from Rocketknight1 Rocketknight1 1 year ago
ArthurZucker
ArthurZucker approved these changes on 2025-03-24
Cyrilvallez raw start
00bcfd47
Cyrilvallez update
aef5f66f
Cyrilvallez update
60595b32
Cyrilvallez add to imports
ddfe10a6
Cyrilvallez update
88f473eb
Cyrilvallez up
5012749a
Cyrilvallez simplify configs
bc1d1975
Cyrilvallez clean configs
e56e7b0d
Cyrilvallez style
8d35ac95
Cyrilvallez typos
f4904822
Cyrilvallez Update convert_phi4_multimodal_weights_to_hf.py
c0e1da4e
Cyrilvallez Update convert_phi4_multimodal_weights_to_hf.py
c435c22e
Cyrilvallez fix
98b393c1
Cyrilvallez up
0bd29a3c
Cyrilvallez up
52bf0e8c
Cyrilvallez up
a37b0845
Cyrilvallez Update convert_phi4_multimodal_weights_to_hf.py
ce4735b3
Cyrilvallez Update convert_phi4_multimodal_weights_to_hf.py
fe9fed1b
Cyrilvallez up
5fffe534
Cyrilvallez up
c102b462
Cyrilvallez up
dbbad21b
Cyrilvallez Update feature_extraction_phi4_multimodal.py
67cad7f4
Cyrilvallez up
cc4cd0e8
Cyrilvallez up
da8b0aa7
Cyrilvallez up
d41fde15
Cyrilvallez up
f78aec53
Cyrilvallez up
18c33dea
Cyrilvallez simplify configs
abd15e41
Cyrilvallez typo
438ee1a8
Cyrilvallez cut code
01f68a09
Cyrilvallez typo
d942e26a
Cyrilvallez typo
0c1d0820
Cyrilvallez typo
e9910cc7
Cyrilvallez re
28c4d400
Cyrilvallez typo
1c744f07
Cyrilvallez up
2659e69f
Cyrilvallez up
d42a60e4
Cyrilvallez up
9a0c3742
Cyrilvallez add tests
7b83b9e7
Cyrilvallez fix
23bbdd5d
Cyrilvallez fix
7598e618
Cyrilvallez Update test_modeling_phi4_multimodal.py
a52acd41
Cyrilvallez up
42c9ca5f
Cyrilvallez Update test_modeling_phi4_multimodal.py
c35fdc06
Cyrilvallez doc
3e507288
Cyrilvallez fix
6638418e
Cyrilvallez up
09ea6b74
Cyrilvallez up
41fc578a
Cyrilvallez up
e1091098
Cyrilvallez up
0f4f425e
Cyrilvallez up
ca1d04b4
Cyrilvallez up
6377c06e
Cyrilvallez simplify
cb829f39
Cyrilvallez up
046cd482
Cyrilvallez simplify
3cbd8dd7
Cyrilvallez config docstrings
07f58277
Cyrilvallez cleanup
28b29e9c
Cyrilvallez clean
719f204e
Cyrilvallez typo
5dd09faf
Cyrilvallez typo
741bfdc0
Cyrilvallez fix
4f909ab5
Cyrilvallez Update phi4_multimodal.md
406ed4c6
Cyrilvallez fix
2de1ac38
Cyrilvallez fix
65bdb474
Cyrilvallez Update test_modeling_phi4_multimodal.py
a20dfebb
Cyrilvallez update
6acd428e
Cyrilvallez simplify reshapes and permutes
249d1fbf
Cyrilvallez up
1da7e9be
Cyrilvallez simplify special tokens
5bcc6c81
Cyrilvallez simplify processor a lot
014c89e7
Cyrilvallez Update processing_phi4_multimodal.py
4bf35b97
Cyrilvallez Update processing_phi4_multimodal.py
d06a8085
Cyrilvallez switch to fast processor
f754f106
Cyrilvallez image processor
1d18749f
Cyrilvallez Update image_processing_phi4_multimodal_fast.py
01778938
Cyrilvallez add lora extraction to converter
1bbb2987
Cyrilvallez Update convert_phi4_multimodal_weights_to_hf.py
80d0e837
Cyrilvallez Update __init__.py
923b2d09
eustlb add AudioInput type in audio_utils
136d45aa
eustlb rewrite feature_extraction: support torch batched FFT
fcd909bc
eustlb input_audio_embeds -> audio_input_features, input_image_embeds -> ima…
11444c7f
eustlb test update
44c72966
eustlb not mono channel warning update
33c61fdd
Cyrilvallez remove auto maps from processor
1a1e0241
Cyrilvallez kargs dispatch in processor
e28dbb04
Cyrilvallez simplify kwargs dispatch
07c21537
Cyrilvallez simplify merging
6334dd7c
Cyrilvallez remove default sampling rate
4aa8086a
Cyrilvallez style
93323fc6
Cyrilvallez Update test_modeling_phi4_multimodal.py
95e5597d
Cyrilvallez update doc
37b3dbe6
Cyrilvallez doc
bc6d6a58
Cyrilvallez torch only feature extractor
b2413772
Cyrilvallez make fake tokens adjustable
9c752b2d
Cyrilvallez up
e1091098
Cyrilvallez up
ca1d04b4
Cyrilvallez simplify
cb829f39
Cyrilvallez Update test_modeling_phi4_multimodal.py
a20dfebb
Cyrilvallez Update processing_phi4_multimodal.py
d06a8085
Cyrilvallez switch to fast processor
f754f106
eustlb rewrite feature_extraction: support torch batched FFT
fcd909bc
Cyrilvallez kargs dispatch in processor
e28dbb04
Cyrilvallez doc
bc6d6a58
Cyrilvallez torch only feature extractor
b2413772
Cyrilvallez fix
d9beef2a
Cyrilvallez Update processing_phi4_multimodal.py
17985f90
Cyrilvallez simplify mask
c169f367
Cyrilvallez last touch
067edbfa
Cyrilvallez fix copies
9bee9f3e
Cyrilvallez Cyrilvallez force pushed from 1ed11d6f to 9bee9f3e 1 year ago
Cyrilvallez style
653b8ece
Cyrilvallez Update audio_utils.py
4213e97c
Cyrilvallez style
24390037
Cyrilvallez Update feature_extraction_phi4_multimodal.py
16f5ca84
Cyrilvallez Update __init__.py
5b773c8b
Cyrilvallez docstrings
a70f307e
Cyrilvallez copies
ac699b18
Cyrilvallez fix all checks
aa6664bf
Cyrilvallez back to fix-copies
c3a1a898
HuggingFaceDocBuilderDev
Cyrilvallez trigger CIs
095bb8a9
Cyrilvallez Update feature_extraction_phi4_multimodal.py
bdc8e386
Cyrilvallez improve tests with multimodal inputs
4f521955
Cyrilvallez trigger CIs
ec726d7d
Cyrilvallez Cyrilvallez merged 4303d88c into main 1 year ago
Cyrilvallez Cyrilvallez deleted the phi4 branch 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone