Fixed Llama-3_1-Nemotron-51B doesn't work when 4K or more tokens #11008
conflict resolution
ecad9665
Merge branch 'ggerganov:master' into master
12aded6c
move comments after bracket to its own line
643e5e8a
Merge branch 'ggerganov:master' into master
e68c76d1
Merge branch 'ggerganov:master' into master
6a4805f8
Merge branch 'ggerganov:master' into master
f9a1cdb3
Merge branch 'ggerganov:master' into master
c1736f30
DeciLMCausalModel now reads rope_theta from config.json properly
984ffac2
slaren
approved these changes
on 2024-12-31
ggerganov
approved these changes
on 2024-12-31
ggerganov
merged
bc7b1f86
into master 342 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub