llama.cpp
9958c81b - Implement the OLMo architecture (#6741)

Commit

1 year ago

Implement the OLMo architecture (#6741) * implement olmo architecture * remove unused variable * remove unused moe branch * remove check for weight * remove superfluous moe, bias and rope tensors * clarified comment * fix clamp_kqv setting * remove obsolete parameter name filter

References

#6741 - Implement the OLMo architecture

Author

nopperl

Parents

8b1b1f49

Files4

README.md
convert-hf-to-gguf.py
gguf-py/gguf
- constants.py
llama.cpp

llama.cpp 9958c81b - Implement the OLMo architecture (#6741)

llama.cpp
9958c81b - Implement the OLMo architecture (#6741)