llama.cpp
llama : add llm_build helper functions
#3848
Merged

llama : add llm_build helper functions #3848

ggerganov merged 18 commits into llama-refactor from llama-refactor-norm
ggerganov
ggerganov ggerganov added refactoring
ggerganov ggerganov added need feedback
ggerganov llama : add llm_build_norm helper function
7db9c96d
ggerganov ggerganov force pushed from 66928991 to 7db9c96d 2 years ago
slaren
ggerganov
ggerganov llama : add llm_build_ffn helper function (#3849)
dbf836bb
ggerganov ggerganov changed the title llama : add llm_build_norm helper function llama : add llm_build helper functions 2 years ago
monatis
ggerganov llama : add llm_build_k_shift helper
38728a0b
ggerganov ggerganov force pushed from c6ae530e to 38728a0b 2 years ago
ggerganov llama : fix offloading after recent changes
909d6447
ggerganov ggerganov force pushed from 61ca4777 to c82880d2 2 years ago
ggerganov ggerganov force pushed from c82880d2 to 85ec6b02 2 years ago
ggerganov ggerganov force pushed from 85ec6b02 to ca15e4ac 2 years ago
ggerganov llama : add llm_build_kv_store helper
3e046259
ggerganov ggerganov force pushed from ca15e4ac to 3e046259 2 years ago
ggerganov llama : remove obsolete offload names
59908619
ggerganov llama : fix llm_build_k_shift to use n_head_kv instead of n_head
31a12f3d
ggerganov llama : simplify falcon Q, K, V computation
a104abea
ggerganov llama : remove obsolete comments in build graphs
c9121fdd
ggerganov llama : add llm_build_kqv helper
f39e6075
ggerganov ggerganov force pushed from d82a5c02 to f39e6075 2 years ago
ggerganov ggerganov marked this pull request as ready for review 2 years ago
ggerganov llama : minor
792d1a1b
ggerganov ggerganov force pushed from 4d115ea1 to 792d1a1b 2 years ago
KerfuffleV2
ggerganov llama : add LLAMA_OFFLOAD_DEBUG + fix starcoder offloading
a3f80013
ggerganov
KerfuffleV2
KerfuffleV2 commented on 2023-10-30
ggerganov
KerfuffleV2
ggerganov llama : fix input allocation logic
2926ef63
ggerganov llama : update offload functions for KQ tensors
6669cd83
ggerganov
ggerganov llama : normalize tensor names
0bfdcdd0
ggerganov llama : enable warning about not offloaded tensors
fc5a26aa
ggerganov ggerganov requested a review from slaren slaren 2 years ago
ggerganov ggerganov requested a review from KerfuffleV2 KerfuffleV2 2 years ago
slaren
slaren commented on 2023-10-31
slaren
slaren commented on 2023-10-31
slaren
slaren commented on 2023-10-31
slaren
slaren commented on 2023-10-31
slaren
slaren
slaren commented on 2023-10-31
KerfuffleV2
KerfuffleV2 commented on 2023-10-31
ggerganov llama : remove extra ; + deduplicate gate_b logic
2073347e
ggerganov llama : add llm_build_inp_embd helper
7923b70c
ggerganov
KerfuffleV2
ggerganov ggerganov merged 5baefef4 into llama-refactor 2 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone