llama.cpp
feat: support StarCoder model architectures
#3187
Merged

feat: support StarCoder model architectures #3187

wsxiaoys
wsxiaoys add placeholder of starcoder in gguf / llama.cpp
0c5d4d87
wsxiaoys support convert starcoder weights to gguf
eb7f0eba
wsxiaoys convert MQA to MHA
76d32cca
wsxiaoys fix ffn_down name
7e0a843b
wsxiaoys add LLM_ARCH_STARCODER to llama.cpp
7298c37e
wsxiaoys set head_count_kv = 1
166a259f
wsxiaoys load starcoder weight
57f064d7
wsxiaoys add max_position_embeddings
a17ef397
wsxiaoys set n_positions to max_positioin_embeddings
26836119
wsxiaoys properly load all starcoder params
77c7ec17
wsxiaoys fix head count kv
0be15e16
wsxiaoys fix comments
dac31da4
wsxiaoys fix vram calculation for starcoder
4420cff6
wsxiaoys store mqa directly
ab13d071
wsxiaoys add input embeddings handling
8bc76a22
ggerganov ggerganov added model
ggerganov
wsxiaoys add TBD
101c5787
wsxiaoys
wsxiaoys working in cpu, metal buggy
a1cf66ea
wsxiaoys
wsxiaoys cleanup useless code
6c353dc7
ggerganov metal : fix out-of-bounds access in soft_max kernels
f82328ab
ggerganov llama : make starcoder graph build more consistent with others
92a4f868
ggerganov
wsxiaoys Merge pull request #2 from ggerganov/support-starcoder-fix
caa72209
wsxiaoys
wsxiaoys refactor: cleanup comments a bit
57eaa39c
wsxiaoys add other starcoder models: 3B, 7B, 15B
5ca037b9
wsxiaoys wsxiaoys marked this pull request as ready for review 1 year ago
wsxiaoys
wsxiaoys support-mqa-directly
08f35c46
wsxiaoys Merge pull request #3 from TabbyML/support-starcoder-mqa
e1fa9dd2
wsxiaoys
Green-Sky
Green-Sky commented on 2023-09-15
wsxiaoys fix: remove max_position_embeddings, use n_train_ctx
f989ba15
ggerganov
ggerganov commented on 2023-09-15
wsxiaoys Update llama.cpp
bb9931cf
ggerganov
ggerganov approved these changes on 2023-09-15
wsxiaoys Update llama.cpp
eafcc34f
ggerganov
ggerganov commented on 2023-09-15
wsxiaoys Apply suggestions from code review
e30ad714
wsxiaoys fix: switch to space from tab
72a72854
ggerganov ggerganov merged 4fe09dfe into master 1 year ago
wsxiaoys wsxiaoys deleted the support-starcoder branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone