Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
ggerganov/llama.cpp
Pull Requests
Commits
Open
Closed
llama: add canaries to Markdown files
#18735 opened 2026-01-10 11:03 by
JohannesGaessler
feat: add support for WeDLM architecture
python
#18731 opened 2026-01-10 02:07 by
feedseawave
lookup, lookahead: fix crash when n_ctx not specified
examples
#18729 opened 2026-01-10 00:09 by
pestopoppa
llama: fix pooled embedding readback sizing/stride and state I/O
#18723 opened 2026-01-09 18:43 by
retr0reg
model: Add VAETKI support
model
examples
python
#18719 opened 2026-01-09 14:42 by
dororodoroddo
ggml: new backend for Virglrenderer API Remoting acceleration (v2)
build
python
ggml
#18718 opened 2026-01-09 13:29 by
kpouget
Support parsing JSON into grammar for schemas with no type and no properties
#18711 opened 2026-01-09 07:37 by
markrietveld
[WIP] ggml-opencl: op args init refactoring
python
ggml
OpenCL
#18701 opened 2026-01-08 16:49 by
chraac
Improving inference speed for the repack buffer type on NUMA architectures
examples
ggml
#18698 opened 2026-01-08 15:01 by
zzjianhui
ggml-cuda: extend concat support for more types
Nvidia GPU
ggml
#18690 opened 2026-01-08 07:36 by
Lourdle
Autoparser - complete refactoring of parser architecture
documentation
model
script
testing
examples
python
server
#18675 opened 2026-01-07 18:45 by
pwilkin
server: support image+text input for embeddings (Qwen3-VL-Embedding)
examples
server
#18665 opened 2026-01-07 12:56 by
ngxson
MCP MVP
enhancement
server/webui
examples
server
#18655 opened 2026-01-07 08:32 by
allozaur
docs: update ops.md for CANN backend
documentation
#18654 opened 2026-01-07 08:22 by
hipudding
CANN: support gated linear attn
ggml
Ascend NPU
#18653 opened 2026-01-07 02:55 by
hipudding
common: use httplib + boringssl by default
build
devops
#18648 opened 2026-01-06 20:30 by
ngxson
[Do Not Merge] model : LFM2.5-Audio-1.5B
model
examples
python
server
#18641 opened 2026-01-06 14:25 by
tdakhran
Remove annoying warnings (unused functions)
#18639 opened 2026-01-06 10:04 by
Nekotekina
alloc : skip unassigned leafs
ggml
#18636 opened 2026-01-06 09:26 by
ggerganov
Added note for compiling on integrated GPUs
documentation
#18633 opened 2026-01-06 04:58 by
alosslessdev
rpc : implement event and async backend APIs
ggml
#18626 opened 2026-01-05 15:11 by
rgerganov
CANN: Remove unused `ggml_cann_get_device` function
ggml
Ascend NPU
#18625 opened 2026-01-05 15:10 by
rauletorresc
sampling: add tail-free (TFS) sampling
#18612 opened 2026-01-05 06:45 by
viralvgupta
Fix grammar parsing issues to prevent stack overflow and hangs
testing
#18604 opened 2026-01-05 02:16 by
aagit
common: build as shared library when BUILD_SHARED_LIBS is ON
#18602 opened 2026-01-05 00:11 by
rsauciuc
memory : add llama_memory_hybrid_iswa
#18601 opened 2026-01-04 23:23 by
tdakhran
mtmd : fix integer overflow when n_tokens equals INT32_MIN
examples
#18588 opened 2026-01-04 09:21 by
ylwango613
Fix division by zero vulnerability in gguf_init_from_file_impl
ggml
#18586 opened 2026-01-04 08:34 by
ylwango613
gguf-hash: add RVV tensor hashing using xxh3
examples
#18576 opened 2026-01-04 02:11 by
ixgbe
GGML RPC - Add support for Unix Domain Sockets
examples
ggml
#18574 opened 2026-01-03 22:44 by
struct
Newer
Older