llama : Support llama 4 text-only #12791
llama4 conversion
79ebef84
initial support, no chat template
b19dbd01
clean up a bit
f6d8e753
fix tokenizer conversion
1fb1888a
correct hparams
869d7d97
try this
6ceae82e
fix shexp
7cfc2373
ffn_inp_normed
edbaaf46
chat template
a518c11e
clean up model conversion
46fe5cbf
add_bos
ab91ab2f
add scale_before_ffn
f9c788df
fix order
e4012e62
Merge branch 'master' into xsn/llama4
2a9b29af
weight_before_ffn
ee06e9b7
ngxson
commented
on 2025-04-07
llm_graph_input_attn_temp
f8f1bd4d
add chunk attn mask
e6a2809c
ngxson
marked this pull request as ready for review 244 days ago
ggerganov
approved these changes
on 2025-04-07
build_inp_attn_scale()
af1968c3
add comment about ggml_repeat
09eba6a5
clarify comments
b28cd9ca
fix build
d3e67f98
ngxson
changed the title llama : Support llama 4 text-only (WIP) llama : Support llama 4 text-only 244 days ago
ngxson
merged
1466621e
into master 244 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub