add PLaMo model #3557

okdshin
okdshin
okdshin commented on 2023-10-09
okdshin okdshin force pushed 2 years ago
ggerganov
okdshin
okdshin
okdshin
cebtenzzre
ggerganov
okdshin
okdshin add plamo mock
feb0966a
okdshin add tensor loading
4c585b4c
okdshin plamo convert
b2330f57
okdshin update norm
9d492365
okdshin able to compile
4a3ef4f2
okdshin fix norm_rms_eps hparam
a22040a8
okdshin runnable
86d5348f
okdshin use inp_pos
f76fd392
okdshin seems ok
ca8f6986
okdshin update kqv code
febc6359
okdshin remove develop code
907b9218
okdshin okdshin force pushed to 907b9218 2 years ago
okdshin update README
9339ffc9
okdshin
ggerganov
ggerganov
okdshin
okdshin shuffle attn_q.weight and attn_output.weight for broadcasting
db1b18dc
okdshin remove plamo_llm_build_kqv and use llm_build_kqv
26340a19
okdshin fix style
700f7c60
okdshin Merge branch 'master' into add_pfnet_plamo_13b_2
602cc71b
okdshin update
307481f2
okdshin
okdshin okdshin requested a review from ggerganov ggerganov 2 years ago
ggerganov llama : remove obsolete KQ_scale
eedd4345
ggerganov plamo : fix tensor names for correct GPU offload
1949c955
ggerganov
ggerganov approved these changes on 2023-12-24
ggerganov ggerganov merged 753be377 into master 2 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone