llama.cpp
86f5623d - llama : fix MiniCPM inference after Granite Four changes (#14850)

Commit

165 days ago

llama : fix MiniCPM inference after Granite Four changes (#14850) MiniCPM models use the llm_build_granite constructor which was changed in the Granite Four PR to use hparams.rope_finetuned instead of a use_rope parameter. MiniCPM models need rope enabled by default. Fixes inference from gibberish to correct responses.

References

#14850 - fix: restore MiniCPM inference after Granite Four changes

Author

jk3456a

Parents

39cffdf1

llama.cpp 86f5623d - llama : fix MiniCPM inference after Granite Four changes (#14850)

llama.cpp
86f5623d - llama : fix MiniCPM inference after Granite Four changes (#14850)