transformers
More appropriate cuda warmup in resource-constrained hardware
#37550
Merged

More appropriate cuda warmup in resource-constrained hardware #37550

Cyrilvallez merged 3 commits into main from lower-cache-allocation
Cyrilvallez
Cyrilvallez better allocation in resource constrained env
3c858e81
Cyrilvallez Update modeling_utils.py
c28678f8
github-actions github-actions marked this pull request as draft 1 year ago
github-actions
Cyrilvallez Cyrilvallez marked this pull request as ready for review 1 year ago
Cyrilvallez CIs
551ab4a5
HuggingFaceDocBuilderDev
Cyrilvallez Cyrilvallez merged 7dafcd00 into main 1 year ago
Cyrilvallez Cyrilvallez deleted the lower-cache-allocation branch 1 year ago
qubvel
qubvel commented on 2025-04-16

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone