Implement the OLMo architecture #6741
implement olmo architecture
29a76704
remove unused variable
d3040271
remove unused moe branch
5c82dae6
remove check for weight
9855bb6d
remove superfluous moe, bias and rope tensors
5993a97e
clarified comment
cc32c739
fix clamp_kqv setting
f6c99ce0
ggerganov
approved these changes
on 2024-04-19
phymbert
approved these changes
on 2024-04-19
remove obsolete parameter name filter
a71963d1
phymbert
merged
9958c81b
into master 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub