llama.cpp
ggml : implement REGLU/GEGLU/SWIGLU ops
#14158
Merged

ggml : implement REGLU/GEGLU/SWIGLU ops #14158

CISC merged 23 commits into master from cisc/unary-reglu-geglu-swiglu
CISC
CISC CISC added help wanted
CISC CISC requested a review from ggerganov ggerganov 200 days ago
github-actions github-actions added testing
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
ggerganov
ggerganov commented on 2025-06-13
CISC CISC changed the title ggml : implement unary REGLU/GEGLU/SWIGLU ops ggml : implement REGLU/GEGLU/SWIGLU ops 200 days ago
CISC CISC requested a review from ggerganov ggerganov 200 days ago
ggerganov
ggerganov approved these changes on 2025-06-13
ggerganov ggerganov requested a review from JohannesGaessler JohannesGaessler 200 days ago
ggerganov
JohannesGaessler
JohannesGaessler commented on 2025-06-13
JohannesGaessler
CISC
github-actions github-actions added Apple Metal
ngxson
ngxson commented on 2025-06-13
JohannesGaessler
0cc4m
JohannesGaessler
JohannesGaessler commented on 2025-06-13
JohannesGaessler
ggerganov
jeffbolznv
CISC
CISC
ngxson
ngxson commented on 2025-06-13
github-actions github-actions added Vulkan
0cc4m
qnixsynapse
CISC
qnixsynapse
github-actions github-actions added SYCL
CISC
qnixsynapse
CISC
CISC
CISC
sgeor255
sgeor255 approved these changes on 2025-06-17
ggerganov
CISC
jeffbolznv
jeffbolznv
CISC
0cc4m
ngxson
CISC
CISC
jeffbolznv
CISC
CISC CISC marked this pull request as draft 195 days ago
CISC CISC changed the title ggml : implement REGLU/GEGLU/SWIGLU ops ggml : implement op fusion, starting with REGLU/GEGLU/SWIGLU 195 days ago
CISC
ggerganov
jeffbolznv
ggerganov
jeffbolznv
ngxson
JohannesGaessler
jeffbolznv
ggerganov
CISC
slaren
CISC
slaren
ggerganov
slaren
ggerganov
jeffbolznv
qnixsynapse
CISC implement unary REGLU/GEGLU/SWIGLU cpu ops
76c9bc17
CISC relax constraints
5a490f07
CISC duplicate shape of source
56c79931
CISC fix ggml_vec_geglu_f16
21c4963b
CISC special case gated ops
bb2fda70
CISC implement unary REGLU/GEGLU/SWIGLU cuda ops
a1a7b6df
CISC tighten constraints again
f8c20809
CISC refactor into GGML_GLU_OP
a341aa3c
ggerganov metal : add glu kernels
d9ddeb9d
CISC add CUDA_GLU_BLOCK_SIZE [no ci]
cfa9c7a4
CISC more constraints and use 64bit ints
70e8b48e
CISC 64bit multiplication [no ci]
f8705a23
CISC implement swapped variants (cpu/cuda)
0b2703fc
CISC update comment [no ci]
d5934297
0cc4m Vulkan: Add GLU ops and shaders
34d1aeda
qnixsynapse SYCL: Implement fused kernel GEGLU, SWIGLU and REGLU for single up+gate
a9aedf46
CISC ggml : implement GLU for split up/gate (#14181)
35dacd1a
qnixsynapse GGML: increase OP count in assertion
a234e09f
qnixsynapse qnixsynapse force pushed from c2af58b0 to a234e09f 191 days ago
CISC
qnixsynapse Refactor: Optimize SYCL element-wise operations with unary function i…
ab46d11d
CISC
qnixsynapse
jeffbolznv
CISC
CISC
slaren
jeffbolznv
jeffbolznv
jeffbolznv vulkan: Increase workgroup size for GLU, for performance (#14345)
650d3987
CISC merge fix
8bca3e2a
CISC CISC changed the title ggml : implement op fusion, starting with REGLU/GEGLU/SWIGLU ggml : implement REGLU/GEGLU/SWIGLU ops 185 days ago
CISC
CISC Merge branch 'master' into cisc/unary-reglu-geglu-swiglu
a6b641e0
ggerganov
ggerganov commented on 2025-06-29
ggerganov metal : add support for split and swap
9054f1a0
ggerganov
ggerganov approved these changes on 2025-06-29
CISC CISC marked this pull request as ready for review 184 days ago
CISC CISC merged a0535ffa into master 184 days ago
CISC CISC deleted the cisc/unary-reglu-geglu-swiglu branch 184 days ago
CISC
markg85
0cc4m
markg85
slaren
CISC
CISC
markg85
slaren
CISC
markg85
slaren
markg85
markg85
slaren
markg85
markg85
slaren

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone