Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
ngxson/llama.cpp
Pull Requests
Commits
Open
Closed
Xsn/mtmd debug (for CI)
examples
#90 by
ngxson
was closed 2026-03-13 21:22
[Mirror] llama: dynamic head_dim and n_rot for SWA
python
model
#89 by
ngxson
was closed 2026-03-09 21:22
[Mirror] Parse port numbers from MCP server URLs in CORS proxy
examples
python
server
#88 by
ngxson
was closed 2026-03-09 16:47
Jinja tests and `lstrip_block` fixes
testing
#73 by
aldehir
was merged 2026-01-04 17:40
jinja whitespace tests
testing
#72 by
aldehir
was merged 2026-01-04 13:17
fix(cuda): prevent UVM driver deadlock on SIGINT
examples
server
#71 by
dnhkng
was closed 2025-12-30 17:15
mimo2: wire RMS eps + MoE bias + converter guards
python
model
#69 by
ngxson
was merged 2025-12-24 11:27
[Mirror] Support Youtu-VL Model
examples
python
#67 by
ngxson
was closed 2025-12-25 09:28
Rebased ASR for LFM2-Audio-1.5B
examples
ggml
python
Nvidia GPU
testing
#58 by
tdakhran
was closed 2025-12-15 13:23
Xsn/cli arrow left right
#53 by
ServeurpersoCom
was merged 2025-12-07 12:29
WebUI updates for llama-server model management feature
examples
ggml
python
server
SYCL
Nvidia GPU
Vulkan
testing
devops
model
OpenCL
#43 by
allozaur
was merged 2025-11-29 22:55
Allozaur/server model management v1 2
examples
server
#42 by
allozaur
was merged 2025-11-24 10:48
Allozaur/server model management v1 2
examples
server
#41 by
ngxson
was merged 2025-11-22 17:32
PoC router/proxy server
examples
server
testing
#34 by
ngxson
was closed 2025-11-15 22:27
FOR CI | common: move download functions to download.(cpp|h)
#33 by
ngxson
was closed 2025-11-15 19:58
Mrope fix
examples
#31 by
ngxson
was merged 2025-10-28 14:56
ggml : refactor forward_dup for cpu backend
ggml
#30 by
xuanson2025
was merged 2025-09-18 02:05
Implement pixel unshuffle block for lfm2vl
examples
#29 by
tdakhran
was closed 2025-08-21 16:39
ggml_scale_bias
#27 by
ngxson
was closed 2025-07-10 11:01
Hunyuan tokenizer
python
#26 by
ngxson
was merged 2025-06-30 15:40
WIP Pixtral
examples
python
#22 by
ngxson
was closed 2025-04-22 14:23
clip : use smart pointers
examples
#21 by
ngxson
was closed 2025-04-10 09:07
Llama-4 mapping
python
#20 by
danielhanchen
was merged 2025-04-10 09:37
llama4 WIP
python
#19 by
ngxson
was closed 2025-04-07 07:53
Xsn/mimi dec
examples
python
#17 by
ngxson
was closed 2025-03-30 11:03
server : avoid common_batch
examples
python
server
#16 by
ggerganov
was merged 2025-03-20 16:33
speculative : adapt to new llama API
examples
#15 by
ggerganov
was merged 2025-03-19 08:15
llama : add stdexcept header for std::runtime_error
#13 by
danbev
was closed 2025-02-05 05:01
Tool calls support improvements
documentation
examples
ggml
python
server
Kompute
SYCL
Nvidia GPU
Vulkan
testing
build
devops
script
android
nix
#11 by
mario7421
was closed 2025-08-24 21:29
fix lora issues
#9 by
slaren
was merged 2024-07-10 08:33
Older