Add CSM model #36719

eustlb merged 163 commits into huggingface:main from eustlb:add-csm
eustlb
eustlb draft structure
48d26477
github-actions github-actions marked this pull request as draft 1 year ago
github-actions
eustlb eustlb added New model
eustlb eustlb added Audio
eustlb depth decoder with forward pre hook
61db81f6
eustlb full model forward draft
7bfe4b78
eustlb draft update
b81db076
eustlb depth decoder update
a305fa30
eustlb ConversationalSpeechModelForCausalLM udpates
36f76329
eustlb add generate
6c4d2d56
eustlb max length criteria small fix
3f3625c9
eustlb udpate
67b6cc47
eustlb updates
89b2e95c
eustlb generation update
f9c19f99
eustlb update in loss compute
1fa74651
eustlb conversion script
2aabe94c
eustlb update for correct input embeddings
f0a72120
eustlb handle interleaved rope
eb712ded
eustlb update
bb39449f
eustlb update
34dc04df
eustlb update
602f6d79
eustlb support compile
63c160d3
eustlb update training
936282bb
eustlb add doc
1e92e69a
eustlb update doc
abee0f7f
eustlb correct inits
4269b061
HuggingFaceDocBuilderDev
SeungyounShin
eustlb
eustlb eustlb marked this pull request as ready for review 344 days ago
github-actions github-actions requested a review from ArthurZucker ArthurZucker 344 days ago
github-actions github-actions requested a review from Rocketknight1 Rocketknight1 344 days ago
eustlb Merge branch 'main' into add-csm
a56f9afd
SeungyounShin
SeungyounShin commented on 2025-04-06
eustlb ConversationalSpeechModel -> Csm
0774d720
eustlb conf update
c0bc80aa
eustlb name update
e32d1eb2
eustlb tests CsmForCausalLMTest
05fa1c42
eustlb convert use cached_file
3dc02241
eustlb conf + modeling updates
cc19ee96
eustlb generate utils handle third dim shape
541128d6
eustlb integration test
02f61000
eustlb modeling + conf updates
24094a2c
eustlb common test handle more than 2 dims
35757b35
eustlb add nested audio list utils
3b7bd2f5
eustlb processing handle nested audio list
8473820c
eustlb csm processing draft
26024906
eustlb mimi util
b5f83988
eustlb init updates
2a94bbca
eustlb modular update
99ecb455
eustlb convert modular
f2cfc398
eustlb processing update
52c0d643
eustlb csm tests update
c1eed72f
eustlb generate tests handle third dim
ab83cd0d
eustlb generate utils handle third dim
ac2373f6
eustlb propagate _get_initial_cache_position update
8dcafaa8
eustlb tied_weight_keys update + convert correctly
3d99d28a
eustlb fix inputs_embeds
a4fb4014
eustlb revert audio nested list
198f4a54
thomasgauthier
eustlb batch inference update + return audio
f3a21a5f
eustlb audio_utils update
cb072790
eustlb processor update
ede5e9c0
eustlb some more integration tests
09b0c6dd
eustlb
eustlb remove old test
23a7d38e
eustlb porcessing output labels
481c2562
eustlb improve
0fc9078a
eustlb fix
39a23ba0
eustlb update rope values with equivalent ones
b637cca2
eustlb conversion update
8d860e5a
eustlb udpate tests
fcb64e60
eustlb handle depth decoder generation config
03ca2f23
eustlb remove default eos_token_id
d6b8ba65
eustlb make style
7477ea02
eustlb revert modeling_mimi
a4b628ee
eustlb add default generation_config
32f34fc4
eustlb remove sdpa since handled by default
d87b30b4
eustlb make
dc486bc4
eustlb eustlb force pushed from 22c9c151 to dc486bc4 324 days ago
eustlb fix conflict
a7724a83
eustlb fix conflicts
856a3e70
eustlb Merge branch 'main' into add-csm
a2423817
eustlb correct naming
380bc4c0
eustlb correct imports
c9cef41d
eustlb make
c6287cb6
eustlb causal -> conditional naming
45176029
eustlb causal -> conditional naming
d9d1cc87
eustlb Merge branch 'main' into add-csm
91d4871a
eustlb auto update
bca3d403
eustlb make
312f57c5
eustlb make
d1dba6f5
eustlb Merge branch 'main' into add-csm
2151964b
eustlb add doc
cbda8891
eustlb Merge branch 'main' into add-csm
44b4f442
eustlb test update
6d9cca31
eustlb fix weight init
9ec03e43
eustlb audio tokens offsets as buffer
8f7f0203
eustlb 4d mask in conditional class
0ce70595
eustlb make
35d9901f
eustlb Merge branch 'main' into add-csm
6961e39d
eustlb Merge branch 'main' into add-csm
c5d9a7d5
patcon
eustlb Merge branch 'main' into add-csm
8130c4e5
eustlb doc update
a68cd027
eustlb fix causal mask
b4831ebc
eustlb fix causal mask
616c12d7
eustlb doc update
a0d2d7ed
eustlb doc update
34f2373b
eustlb Merge branch 'main' into add-csm
7919e9f9
eustlb
eustlb add processor doc
bf2aa574
eustlb update doc
eb8a48d7
eustlb fix 4d causal mask
c046ca58
eustlb update make_list_of_audio
0e664849
eustlb
eustlb commented on 2025-04-25
eustlb Merge branch 'main' into add-csm
0e2b49bb
ArthurZucker ArthurZucker requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 319 days ago
ArthurZucker
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2025-04-29
gante
ArthurZucker
ArthurZucker commented on 2025-04-29
eustlb do not default to mutable
bf5adc4a
eustlb remove duplicates
1e1eccb5
eustlb Merge branch 'main' into add-csm
56506c0b
eustlb remove useless reset_parameters
9d329b64
eustlb use GradientCheckpointingLayer
f5dfa773
eustlb use can_return_tuple
57fcdf22
eustlb formatting
92e26a58
eustlb prepend placeholder in _sample
72df6a69
eustlb torch compile fix
93018727
eustlb some more fixies
ca19f88f
eustlb convert modular
45faaa12
eustlb Merge branch 'main' into add-csm
2ead9ecf
eustlb fix
e5fbb9ff
eustlb default max_length in convert
0ddb8351
eustlb handle depth decoder generation config correctly
d76d9ab3
eustlb
ArthurZucker
ArthurZucker approved these changes on 2025-04-30
eustlb clearer formulation
adc10c46
eustlb handle output_loading_info
0089658d
eustlb handle softmax warning
9536cff0
eustlb Merge branch 'main' into add-csm
f2467461
eustlb Merge branch 'main' into add-csm
2478ef78
eustlb add doc
27a6a187
eustlb propagate _get_initial_cache_position changes
8d10f9de
eustlb generation in its own module
fe7f971c
eustlb add processor tests
3e9dc637
eustlb
eustlb fix compile witu cuda graphs
d017f121
eustlb fix compile with cuda graphs
65447996
eustlb add csm.md
523eabaf
eustlb include CSM loss
d324b0a5
eustlb Merge branch 'main' into add-csm
c275a8b8
eustlb doc nit
73971f45
eustlb Merge branch 'add-csm' of github.com:eustlb/transformers into add-csm
90a9ad73
eustlb doc nit
498edee0
eustlb doc nit
76f5b58c
ArthurZucker
ArthurZucker approved these changes on 2025-05-02
eustlb Update docs/source/en/model_doc/csm.md
0afbe137
eustlb add save_audio to processor
c528bd67
eustlb Merge branch 'add-csm' of github.com:eustlb/transformers into add-csm
7e7d0ef2
eustlb Update src/transformers/models/csm/modular_csm.py
7148da10
eustlb doc update
c5f4fcba
eustlb simplify audio_codes_mask computation
cabba017
eustlb doc update
e2e86020
eustlb Merge branch 'main' into add-csm
77d9e110
eustlb simplify loss computation
297a6263
eustlb fix static cache test
45fc46ea
eustlb fix
3204df4b
eustlb remove comment
85498292
eustlb Merge branch 'main' into add-csm
b518012a
ArthurZucker
ArthurZucker approved these changes on 2025-05-05
eustlb Merge branch 'main' into add-csm
4a514d19
eustlb simplify encoded length computation
aa00fa63
eustlb use hf-internal-testing
050907df
eustlb Merge branch 'main' into add-csm
d5a86f6c
eustlb Merge branch 'main' into add-csm
1a71dfec
eustlb doc update
abef3489
eustlb Merge branch 'add-csm' of github.com:eustlb/transformers into add-csm
99995912
eustlb cast to float before numpy
a1fc7174
eustlb nit
42d88a72
eustlb mem efficient codebook head
7bf37b0f
eustlb Merge branch 'main' into add-csm
7b6a275b
eustlb nit
08728439
eustlb Merge branch 'add-csm' of github.com:eustlb/transformers into add-csm
d6272749
eustlb Merge branch 'main' into add-csm
0ae894fc
eustlb cat input values with cutoffs
b4bca5c0
eustlb Merge branch 'main' into add-csm
3d72be35
eustlb eustlb merged 798f948e into main 311 days ago
johnwick123f
ArthurZucker

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone