handle offload_state_dict when initing transformers models #12438
handle offload_state_dict when initing transformers models
447e8322
Merge branch 'main' into fix-transformers-model-init
38a5fff5
DN6
approved these changes
on 2025-10-07
sayakpaul
merged
2d69bacb
into main 250 days ago
sayakpaul
deleted the fix-transformers-model-init branch 242 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub