llama.cpp
7bed317f - models : fix the attn_factor for mistral3 graphs + improve consistency (#17945)

Commit
4 days ago
models : fix the attn_factor for mistral3 graphs + improve consistency (#17945) * models : fix the attn_factor for mistral3 graphs * cont : rework attn_factor correction logic * cont : make deepseek2 consistent * cont : add TODO * cont : special-case DSv2 * cont : revert Mistral 3 Large changes * cont : fix DS2 to use the original attn_factor * cont : minor comments
Author
Parents
Loading