transformers
Introduce GradientCheckpointingLayer
#37223
Merged

Introduce GradientCheckpointingLayer #37223

qubvel
qubvel GradientCheckpointingLayer
486f1553
qubvel qubvel marked this pull request as ready for review 364 days ago
github-actions github-actions requested a review from ArthurZucker ArthurZucker 364 days ago
github-actions github-actions requested a review from Cyrilvallez Cyrilvallez 364 days ago
qubvel trigger
e5d326a4
qubvel
github-actions
HuggingFaceDocBuilderDev
ArthurZucker
ArthurZucker approved these changes on 2025-04-04
qubvel Move GC layer to a separate file
70dc32e7
qubvel Update import
9f8c1ce4
qubvel Expose and document GC layer
fc96fadb
qubvel Merge branch 'main' into gradient-checkpointing-layer
657c538a
qubvel Merge branch 'main' into gradient-checkpointing-layer
5a7dd6b8
qubvel Fix dummy
72baa13c
qubvel Apply to llama-based models
31f67201
qubvel Update modulars
334043a3
qubvel Update a few more models for consistency
d43dfd43
qubvel
qubvel qubvel requested a review from ArthurZucker ArthurZucker 358 days ago
qubvel Merge branch 'main' into gradient-checkpointing-layer
da0de606
qubvel Update glm4
23e6e24e
ArthurZucker
ArthurZucker approved these changes on 2025-04-11
qubvel Merge branch 'main' into gradient-checkpointing-layer
2093264a
qubvel Merge branch 'main' into gradient-checkpointing-layer
2e913fd9
qubvel Update Janus
4640e46b
qubvel qubvel merged 9167fada into main 344 days ago
sfc-gh-sbekman

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone