accelerate
Give example on how to handle gradient accumulation with cross-entropy
#3193
Merged

Give example on how to handle gradient accumulation with cross-entropy #3193

ylacombe
ylacombe Add cross-entropy example in the gradient accumulation docs
2eb684e9
ylacombe add example of logs
4d4ed806
HuggingFaceDocBuilderDev
ylacombe correct skeleton code
3b8c8872
muellerzr
muellerzr approved these changes on 2024-10-24
muellerzr
muellerzr commented on 2024-10-24
ylacombe replace gather_for_metrics with gather
c01827c1
ylacombe batch_size -> per_device_batch_size
22cbf9c5
ylacombe remove main_process_only=True
395c572d
SunMarc
SunMarc commented on 2024-10-24
ylacombe add autoregressive example in examples/
2e80bf03
ylacombe Update docs/source/usage_guides/gradient_accumulation.md
5e3e8118
ylacombe ruff format
c56c7802
ylacombe add grad accum test
80c720a9
ylacombe update docs
e5d2c50b
muellerzr
muellerzr commented on 2024-10-30
muellerzr
muellerzr commented on 2024-10-30
github-actions
JaheimLee
ylacombe Update examples/by_feature/gradient_accumulation_for_autoregressive_m…
0e1bb896
ylacombe update tests
cc8bcc88
ylacombe ylacombe requested a review from muellerzr muellerzr 1 year ago
muellerzr
muellerzr approved these changes on 2024-12-11
muellerzr muellerzr requested a review from SunMarc SunMarc 1 year ago
SunMarc
SunMarc approved these changes on 2024-12-24
SunMarc SunMarc merged acfbf72a into main 1 year ago
JaheimLee
SunMarc
JaheimLee
SunMarc

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone