Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
ggerganov/llama.cpp
Pull Requests
Commits
Open
Closed
Fix bitwise operation between different enumeration types warning
examples
#24229 by
whoozle
was closed 2026-06-06 11:19
Ljy/ops 0604
documentation
script
testing
examples
python
ggml
Apple Metal
OpenCL
#24227 by
DDra961
was closed 2026-06-06 09:11
completion : remove useless statics
examples
#24226 by
angt
was merged 2026-06-06 10:16
ggml-cuda : use int64 for norm forward dst offset (match rms_norm_back)
Nvidia GPU
ggml
#24214 by
palios-taey
was closed 2026-06-06 01:00
completion : fix format specifier in LOG_INF
examples
#24213 by
angt
was merged 2026-06-06 09:24
server : clamp n_discard to the available context window
examples
server
#24212 by
palios-taey
was closed 2026-06-06 01:07
model : rename local n_layer_all variable
merge ready
#24209 by
CISC
was merged 2026-06-06 04:07
context : fix off-by-one comparisons to n_gpu_layers
merge ready
#24208 by
CISC
was merged 2026-06-06 04:06
Sk/opencl-0606
documentation
script
testing
examples
python
ggml
Apple Metal
OpenCL
#24207 by
HDSulfox
was closed 2026-06-05 19:30
server: add -pp parameter to force enable/disable pipeline parallelism
#24205 by
dark-penguin
was closed 2026-06-05 20:55
server: add -pp parameter to force enable/disable pipeline parallelism
#24204 by
dark-penguin
was closed 2026-06-05 18:41
Add Paligemma 1 support
model
examples
python
#24200 by
tboinovski1
was closed 2026-06-05 18:02
fix: call n_layer() in model info logging
#24195 by
alecmack
was closed 2026-06-05 16:07
model: fix build failed
#24193 by
ngxson
was merged 2026-06-05 16:12
cli: refresh sampling defaults after model load
examples
#24192 by
he-yufeng
was closed 2026-06-05 15:29
model : fix llama_model::n_gpu_layers()
#24188 by
ggerganov
was merged 2026-06-05 14:11
vulkan: check coopmat2 features before reporting support
Vulkan
ggml
#24186 by
0cc4m
was merged 2026-06-06 07:11
Add experimental ROCmFP4 quantization support
testing
Nvidia GPU
Vulkan
examples
python
ggml
#24184 by
charlie12345
was closed 2026-06-05 13:40
TP: round up granularity to 128
#24180 by
JohannesGaessler
was merged 2026-06-05 15:35
common/chat : unify and fix LFM2/LFM2.5 tool parser
testing
#24178 by
tdakhran
was merged 2026-06-05 19:31
ui: run npm install when package-lock.json is newer than node_modules
script
#24171 by
ServeurpersoCom
was merged 2026-06-05 12:57
Fix link to available UI settings
examples
server
#24169 by
wariuccio
was merged 2026-06-05 12:39
minor : fix lint issues
model
#24165 by
ggerganov
was merged 2026-06-05 08:17
opencl: improve get_rows, cpy, concat and q6_k flat gemv
ggml
OpenCL
#24160 by
lhez
was merged 2026-06-05 20:45
ci : build-msys job slimming [no ci]
devops
#24157 by
danbev
was merged 2026-06-05 05:57
vulkan: Walsh-Hadamard fast path segfaults on AMD iGPU (Radeon 780M) — same pattern as Intel Arc
Vulkan
ggml
#24155 by
e4779
was closed 2026-06-05 04:52
ui: add ignore-scripts=true to npmrc
examples
server/ui
#24149 by
ngxson
was merged 2026-06-05 12:31
mtmd : validate mmproj metadata arrays before consuming
examples
#24142 by
palios-taey
was closed 2026-06-04 21:13
minja : cap range() materialization in chat-template render
jinja parser
#24140 by
palios-taey
was closed 2026-06-04 19:01
ggml-rpc : bound rdma_recv completion length before copy
ggml
#24136 by
palios-taey
was closed 2026-06-04 21:13
Older