llama.cpp
llama: add initial support for Falcon-H1 model family
#14534
Merged

llama: add initial support for Falcon-H1 model family #14534

CISC merged 112 commits into ggml-org:master from tiiuae:add-fh1-rebased
ibrahimkhadraoui
younesbelkada v1
991de6cb
younesbelkada push more fixes
f897efda
younesbelkada another fix
71a6848e
younesbelkada fix
03568c93
younesbelkada more fixes
0c93ef6a
younesbelkada minor fix
fdd5cff4
younesbelkada more cleaning on python code
14c37ec0
ibrahimkhadraoui python fixes
8bea9226
ibrahimkhadraoui changed precision for multipliers float 32->64
071f4b7f
younesbelkada fixes
50eadc7b
younesbelkada merge
a39a8423
younesbelkada another fix
1415cd87
younesbelkada fix
243e4d1a
younesbelkada pre-norm -> norm
cce35498
younesbelkada fix
22de62cf
ibrahimkhadraoui Revert "fix"
2fe057cc
ibrahimkhadraoui Merge branch 'add-fh1-rebased' of https://github.com/tiiuae/llama.cpp…
d22b4ea4
younesbelkada fix
6c7d9e26
ibrahimkhadraoui small fix ffn_norm
15138df4
ibrahimkhadraoui Merge branch 'add-fh1-rebased' of https://github.com/tiiuae/llama.cpp…
a6d0067d
younesbelkada try
1fd0574a
younesbelkada mix instead of max
250b4f10
ibrahimkhadraoui fix vocab size
3ee79839
ibrahimkhadraoui Merge branch 'add-fh1-rebased' of https://github.com/tiiuae/llama.cpp…
2aa48dd8
ibrahimkhadraoui conflict solve
9760c8bc
ibrahimkhadraoui fixed multipliers
7a25441e
github-actions github-actions added python
jacekpoplawski
ibrahimkhadraoui
jacekpoplawski
compilade
compilade commented on 2025-07-04
ibrahimkhadraoui
ggerganov
ggerganov commented on 2025-07-05
ibrahimkhadraoui falcon-h1 specefic vocab resolved
280dd2dc
ibrahimkhadraoui read arch from gguf.MODEL_ARCH
c56ec07a
ibrahimkhadraoui mamba_d_ssm added to d_inner find_hparam
c4af0f3c
ibrahimkhadraoui remove unused functions from gguf_writer.py
53304c84
ibrahimkhadraoui override modify_tensors instead of get_tensors
441d8d66
younesbelkada fix conversion and d_inner
6c39e775
ibrahimkhadraoui added some cb functions for debugging puposes
8c508938
ibrahimkhadraoui inp_out_ids moved outside of layers loop
49d74209
ibrahimkhadraoui mup_vec create as float64
97011d7a
ibrahimkhadraoui fix rope_theta
286e1fa5
ibrahimkhadraoui Merge branch 'add-fh1-rebased' of https://github.com/tiiuae/llama.cpp…
b3bc1fb2
younesbelkada injected mup
a9f3a63d
younesbelkada clean ups
e96cc733
ibrahimkhadraoui Merge pull request #1 from tiiuae/injected-mup
3afb2a89
ibrahimkhadraoui rm extra space
0ad35028
ibrahimkhadraoui rm unused MAMBA_CHUNK_SIZE
53446f7e
ibrahimkhadraoui rm unused key
ae937f44
ibrahimkhadraoui add bos False
b6df0a49
ibrahimkhadraoui changed ROPE_TYPE
935d46fa
ibrahimkhadraoui cleaning debugging stuff
624699c5
ibrahimkhadraoui cleaning debug quant
042e5ff9
younesbelkada fix comment
f74e266f
younesbelkada some cleanups
632861e6
younesbelkada some cleanups
084873c2
younesbelkada
younesbelkada commented on 2025-07-07
younesbelkada Update src/llama-model-loader.cpp
fd203302
younesbelkada more cleanups
68cb7845
younesbelkada moe cleanuips
d2f46f18
ibrahimkhadraoui ibrahimkhadraoui requested a review from ggerganov ggerganov 252 days ago
ibrahimkhadraoui ibrahimkhadraoui requested a review from compilade compilade 252 days ago
ibrahimkhadraoui ibrahimkhadraoui marked this pull request as ready for review 252 days ago
ibrahimkhadraoui
ggerganov
ggerganov commented on 2025-07-07
ggerganov
ggerganov commented on 2025-07-07
compilade
compilade commented on 2025-07-07
younesbelkada d_ssm -> d_inner;
7d7da0b3
ibrahimkhadraoui cleaning unused hparams
67b26642
ibrahimkhadraoui Merge branch 'add-fh1-rebased' of https://github.com/tiiuae/llama.cpp…
da8a3385
ibrahimkhadraoui cleanup
e63ee464
younesbelkada more cleanups
d473d428
younesbelkada more cleanups on python conversion;
8555ee8b
ibrahimkhadraoui minor cleanups
7846c67e
younesbelkada Apply suggestions from code review
2dee7cf9
younesbelkada remove todo
a846d023
ibrahimkhadraoui Merge branch 'add-fh1-rebased' of https://github.com/tiiuae/llama.cpp…
f028a43a
ibrahimkhadraoui Merge branch 'add-fh1-rebased' of https://github.com/tiiuae/llama.cpp…
d41f1114
ibrahimkhadraoui added falcon-h1
f266d145
younesbelkada tensor not required
4bc9e0ca
ibrahimkhadraoui clean
2834a4ac
ibrahimkhadraoui ibrahimkhadraoui requested a review from ggerganov ggerganov 251 days ago
ibrahimkhadraoui ibrahimkhadraoui requested a review from younesbelkada younesbelkada 251 days ago
ibrahimkhadraoui
younesbelkada remove unneeded attributes
823696ba
younesbelkada more cleanups and fixed conversion
adff470c
ggerganov
ggerganov commented on 2025-07-08
younesbelkada remove final_norm
097df0ed
ggerganov
ggerganov approved these changes on 2025-07-08
ibrahimkhadraoui flake8 fixes
9a048d8d
ibrahimkhadraoui Merge branch 'add-fh1-rebased' of https://github.com/tiiuae/llama.cpp…
52d1ef35
CISC
CISC commented on 2025-07-08
younesbelkada Update src/llama-model.cpp
58e3866d
younesbelkada Merge branch 'master' into add-fh1-rebased
d28c31a9
ibrahimkhadraoui flake8 fixes
9b926483
CISC
CISC requested changes on 2025-07-08
ibrahimkhadraoui Update src/llama-hparams.cpp
7fe1794c
ibrahimkhadraoui Update src/llama-model.cpp
40058c04
ibrahimkhadraoui Update src/llama-model.cpp
debf4e5d
ibrahimkhadraoui Update src/llama-arch.cpp
212edffd
ibrahimkhadraoui Update convert_hf_to_gguf.py
90ddf241
CISC
CISC commented on 2025-07-08
ibrahimkhadraoui Merge branch 'add-fh1-rebased' of https://github.com/tiiuae/llama.cpp…
7edf3800
ibrahimkhadraoui added hashes
c3c5d51c
ggerganov
ggerganov commented on 2025-07-08
younesbelkada Update src/llama-arch.cpp
f8d7c970
younesbelkada Update src/llama-vocab.cpp
4610ee20
younesbelkada update the update file
082ab4ad
younesbelkada Revert "update the update file"
c5515e36
younesbelkada fix: address suggestions
1ef53b31
younesbelkada fix: update convert_hf_to_gguf.py
d5efbd01
CISC
CISC commented on 2025-07-08
younesbelkada Update gguf-py/gguf/constants.py
a5afc8bc
younesbelkada
CISC
CISC commented on 2025-07-08
younesbelkada Update src/llama-model-loader.cpp
99f9a3dd
ibrahimkhadraoui d_inner fixed
c3c64c35
CISC
CISC commented on 2025-07-08
younesbelkada Update src/llama-model.cpp
63e3afc9
CISC
CISC approved these changes on 2025-07-08
compilade
compilade commented on 2025-07-08
compilade
compilade commented on 2025-07-08
ibrahimkhadraoui reshaping ssm_norm for 34B
d7585783
ibrahimkhadraoui Merge branch 'add-fh1-rebased' of https://github.com/tiiuae/llama.cpp…
8972c15f
ibrahimkhadraoui removing generate_mup
7897c217
ibrahimkhadraoui remove duplicates metadata keys
6403caae
ibrahimkhadraoui rm comment
710630a5
compilade
compilade commented on 2025-07-08
younesbelkada Merge branch 'master' into add-fh1-rebased
7b9aa7ba
younesbelkada final comment
ecc5253d
younesbelkada fix unused args
bbca33eb
younesbelkada
younesbelkada fix constants
9f514e39
CISC
CISC commented on 2025-07-08
younesbelkada fix bad merge
34c5d830
compilade
compilade commented on 2025-07-08
younesbelkada Update src/llama-model.cpp
521e8235
younesbelkada falcon-h1: remove unused ssm_in_b and bad merge
6943f4ea
compilade
compilade approved these changes on 2025-07-08
CISC
CISC commented on 2025-07-08
younesbelkada Update src/llama-model.cpp
4d2c94b7
younesbelkada falcon-h1: fix last comment
b7c9a995
CISC
CISC commented on 2025-07-08
compilade
compilade requested changes on 2025-07-08
younesbelkada Update convert_hf_to_gguf.py
9fd308d8
younesbelkada falcon-h1: revert add_add_bos(False)
51f50bfb
compilade
compilade approved these changes on 2025-07-09
younesbelkada
compilade
younesbelkada
younesbelkada
CISC
CISC commented on 2025-07-09
younesbelkada falcon-h1: fix tied weights
367d8c58
younesbelkada falcon-h1: remove whitespace
1fa361bb
younesbelkada falcon-h1: fix wrong size param
6dde986a
younesbelkada falcon-h1: fix whitespace issues
94ab3a88
CISC CISC merged 04655063 into master 250 days ago
CISC
CISC
younesbelkada
gabe-l-hart
gabe-l-hart commented on 2025-07-09
ggerganov ggerganov added hot

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone