llama.cpp
llama : use n_embd_head_v instead of n_embd_head_k when reshaping kqv
#7327
Merged

llama : use n_embd_head_v instead of n_embd_head_k when reshaping kqv #7327

fairydreaming
sszymczy llama : use n_embd_head_v instead of n_embd_head_k when reshaping kqv
f15e933f
mofosyne mofosyne added bugfix
mofosyne mofosyne added Review Complexity : Medium
sszymczy llama : use n_embd_v_gqa and n_embd_head_v instead of n_embd_k_gqa an…
886f89da
fairydreaming
ggerganov
ggerganov approved these changes on 2024-05-17
fairydreaming
ggerganov ggerganov merged 27b04069 into master 1 year ago
fairydreaming fairydreaming deleted the llm_build_kqv_fix branch 228 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone