llama.cpp
model: try to improve Qwen3 Next
#18683
Merged

model: try to improve Qwen3 Next #18683

ngxson
ngxson qwen3next: simplify qkvz projection
721cbe35
ngxson use ggml_swiglu_split
ed4e9ceb
ngxson revert swiglu_split, but remove redundant repeat()
efc312fc
ngxson ngxson requested a review from CISC CISC 91 days ago
ngxson ngxson removed review request from CISC CISC 91 days ago
ngxson ngxson requested a review from pwilkin pwilkin 91 days ago
ngxson ngxson marked this pull request as draft 91 days ago
pwilkin
ngxson
jeffbolznv
ngxson
pwilkin
github-actions github-actions added model
github-actions github-actions added python
jacekpoplawski
am17an
jeffbolznv
IIIIIllllIIIIIlllll
ggerganov
ngxson
ngxson
ngxson
pwilkin
ngxson
ngxson fix missing reshape
c77001f2
jeffbolznv
ggerganov
ngxson
danbev
lemmi
jeffbolznv
ngxson
ngxson Merge branch 'master' into xsn/qwen3next_improve
033fd273
lemmi
ngxson rm 2 redundant transposes
d96eb69e
ngxson
lemmi
jeffbolznv
danbev
ngxson
ggerganov
ggerganov commented on 2026-01-10
ngxson move mul_mat(k,q) to outside of chunking
2a39955a
ngxson rm redundant cont
e1f8ad25
ngxson improve g_cs_chunk
939767c3
ngxson add comments about no cont
f38fc605
ngxson use std::pair instead of ggml_concat
f8ad742a
ngxson
lemmi
CISC
CISC commented on 2026-01-10
IIIIIllllIIIIIlllll
ngxson vectorize key_gdiff calculation
5ec140e0
ngxson rm unused tensor
9299ced6
ngxson
ngxson commented on 2026-01-10
jeffbolznv
ngxson avoid ggml_concat inside loop
e41d9100
ngxson bring back ggml_concat as it may not work on other backend
329112c5
ngxson
ngxson nits
d5a08569
ngxson
ngxson ngxson marked this pull request as ready for review 88 days ago
ngxson ngxson requested a review from ggerganov ggerganov 88 days ago
ngxson ngxson requested a review from CISC CISC 88 days ago
IIIIIllllIIIIIlllll
jeffbolznv
jeffbolznv
danbev
jacekpoplawski
CISC
CISC approved these changes on 2026-01-11
ngxson ngxson merged 506bb6e0 into master 87 days ago
ngxson
SirSchnobi
Som-anon
ngxson
danielhanchen
danielhanchen
bartowski1182

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone