transformers
[Trainer] accelerate contextparallel support in trainer
#40205
Merged

[Trainer] accelerate contextparallel support in trainer #40205

SunMarc merged 33 commits into main from trainer-cp
kashif
kashif initial context_parallel_size support in trainer
ca8e3663
kashif Merge branch 'main' into trainer-cp
29b22cf7
SalmanMohammadi
SalmanMohammadi commented on 2025-08-15
SalmanMohammadi
SalmanMohammadi commented on 2025-08-15
kashif For context parallelism, use AVG instead of SUM to avoid over-account…
d70fef40
kashif git pushMerge branch 'trainer-cp' of https://github.com/huggingface/t…
dd58dd01
kashif use parallelism_config.cp_enabled
ecc23661
HuggingFaceDocBuilderDev
kashif add parallelism_config to trainer state
a629ff0a
winglian
winglian commented on 2025-08-15
SalmanMohammadi
SalmanMohammadi commented on 2025-08-15
kashif warn when auto-enabling FSDP
66d4273e
S1ro1
S1ro1 commented on 2025-08-15
S1ro1
S1ro1 commented on 2025-08-15
S1ro1
S1ro1 commented on 2025-08-15
S1ro1
S1ro1 commented on 2025-08-15
S1ro1
S1ro1 commented on 2025-08-15
kashif fix some reviews
ffa4699c
S1ro1 WIP: somewhat matching loss
361f122f
S1ro1
S1ro1 commented on 2025-08-18
S1ro1
S1ro1 commented on 2025-08-18
S1ro1
S1ro1 commented on 2025-08-18
S1ro1
S1ro1 commented on 2025-08-18
S1ro1
kashif Merge branch 'main' into trainer-cp
3d426c16
kashif Merge branch 'main' into trainer-cp
3efe69bf
S1ro1 Feat: add back nested_gather
eca52ac0
S1ro1 Feat: cleanup
951527b1
S1ro1 Fix: raise on non-sdpa attn
412c15e2
S1ro1 Merge branch 'main' into trainer-cp
be60c40f
kashif remove context_parallel_size from TrainingArguments
2c357aab
S1ro1
S1ro1 commented on 2025-08-19
kashif if we have parallelism_config, we defer to get_state_dict from accele…
71e082f8
kashif Merge branch 'main' into trainer-cp
37e6fdfe
SunMarc
SunMarc commented on 2025-08-21
kashif Merge branch 'main' into trainer-cp
4f6fe153
kashif fix form review
485d7fab
kashif
S1ro1
S1ro1 commented on 2025-08-22
S1ro1 Feat: add parallelism config support
3d16def7
S1ro1 Chore: revert some unwanted formatting changes
25a308e8
S1ro1 Fix: check None
6d413651
S1ro1 Check none 2
d82022c9
S1ro1 Fix: remove duplicate import
ae9f878b
S1ro1 S1ro1 force pushed from 94c58b58 to ae9f878b 120 days ago
S1ro1 Merge branch 'main' into trainer-cp
531924e2
S1ro1
SunMarc Merge branch 'main' into trainer-cp
64d73362
SunMarc
SunMarc approved these changes on 2025-08-22
SunMarc SunMarc requested a review from winglian winglian 120 days ago
S1ro1 Update src/transformers/trainer.py
52cb3bc9
S1ro1 Update src/transformers/training_args.py
6e9fb308
kashif Merge branch 'main' into trainer-cp
bf187b22
S1ro1 Fin
33817f37
SunMarc
SunMarc approved these changes on 2025-08-25
SunMarc
SunMarc commented on 2025-08-25
kashif
kashif require accerelate 1.10.1 and higer
22945068
SunMarc
SunMarc approved these changes on 2025-08-26
SunMarc Merge branch 'main' into trainer-cp
f956348d
SunMarc SunMarc enabled auto-merge (squash) 116 days ago
SunMarc SunMarc merged 6d2bb1e0 into main 116 days ago
SunMarc SunMarc deleted the trainer-cp branch 116 days ago
sfc-gh-sbekman
kashif

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone