llama.cpp
model : add grok-2 support
#15539
Merged

model : add grok-2 support #15539

CISC merged 32 commits into master from cisc/grok-2
CISC
CISC add grok-2 support
551a64f2
github-actions github-actions added python
CISC type fix
301ba771
CISC CISC added model
CISC type fix
cf87c766
CISC type fix
f582b845
CISC "fix" vocab for invalid sequences
711ab174
CISC fix expert tensor mapping and spaces in vocab
8edece83
CISC
nicoboss
CISC add chat template
3ef6cf57
CISC
CISC
nicoboss
CISC
CISC fix norm tensor mapping
25e4e5f0
CISC rename layer_out_norm to ffn_post_norm
4a53f132
nicoboss
CISC
CISC ensure ffn_post_norm is mapped
e0a0024e
CISC fix experts merging
d7efed89
nicoboss
pwilkin
CISC
CISC remove erroneous FFN_GATE entry
92266e96
danielhanchen
nicoboss
CISC
CISC concatenate split tensors and add more metadata
6b3f7755
CISC
nicoboss
CISC
CISC process all expert layers and try cat instead of hstack
c5566638
CISC
nicoboss
CISC
CISC add support for community BPE vocab
9f868763
CISC fix expert feed forward length and ffn_down concat
5d4e4073
CISC
CISC commit this too
3e83c648
JohannesGaessler
CISC
JohannesGaessler
CISC
nicoboss
CISC
nicoboss
CISC
CISC add ffn_up/gate/down, unsure if sequence is right
b1627ce5
nicoboss
CISC add ffn_gate/down/up to tensor names
00481afe
CISC
nicoboss
CISC
CISC correct residual moe (still not working)
2e8b67b0
CISC mess--
94bcbbfe
nicoboss
CISC
CISC fix embedding scale being applied twice
b7675ea0
CISC
CISC add built in chat template
6cf16aaf
CISC
ggerganov
slaren
CISC
jacekpoplawski
CISC
pwilkin
CISC
CISC
jacekpoplawski
CISC
CISC change beta fast for grok if default value
4abde12c
CISC remove spm vocab in favor of community bpe vocab
705f84a7
CISC change attention temp length metadata type to integer
a8fa83f2
CISC update attention temp length metadata
05b52fa5
CISC remove comment
b7bfc9a6
CISC Merge branch 'master' into cisc/grok-2
c0d755cd
CISC CISC marked this pull request as ready for review 187 days ago
CISC
CISC CISC requested a review from slaren slaren 187 days ago
slaren
slaren commented on 2025-09-03
CISC replace M_SQRT2 with std::sqrt(2)
0408a4fa
jacekpoplawski
shimmyshimmer
CISC Merge branch 'master' into cisc/grok-2
ed4d8f22
CISC
CISC add yarn metadata, move defaults to hparams
d032a1b0
CISC CISC requested a review from ggerganov ggerganov 177 days ago
ggerganov
ggerganov approved these changes on 2025-09-13
CISC CISC merged b8e09f08 into master 175 days ago
CISC CISC deleted the cisc/grok-2 branch 175 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone