HF <-> megatron checkpoint reshaping and conversion for GPT #19317
HF <-> megatron checkpoint conversion handling reshaping from differe…
b5773bd6
Apply suggestions from code review
dc553a75
addressing comments
2091e0f9
sgugger
approved these changes
on 2022-10-07
add doc strings and 🐛 fixes
28850fb3
pacman100
merged
56af8df3
into main 3 years ago
pacman100
deleted the smangrul/megatron-hf-conversion-utils branch 3 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub