transformers
HF <-> megatron checkpoint reshaping and conversion for GPT
#19317
Merged

HF <-> megatron checkpoint reshaping and conversion for GPT #19317

pacman100
pacman100 HF <-> megatron checkpoint conversion handling reshaping from differe…
b5773bd6
pacman100 pacman100 requested a review from sgugger sgugger 3 years ago
HuggingFaceDocBuilderDev
sgugger
sgugger commented on 2022-10-04
pacman100 Apply suggestions from code review
dc553a75
pacman100 addressing comments
2091e0f9
sgugger
sgugger approved these changes on 2022-10-07
pacman100 add doc strings and 🐛 fixes
28850fb3
pacman100 pacman100 merged 56af8df3 into main 3 years ago
pacman100 pacman100 deleted the smangrul/megatron-hf-conversion-utils branch 3 years ago
dumpmemory
lierik

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone