Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
ngxson/llama.cpp
Pull Requests
Commits
Open
Closed
server: support OAI /v1/audio/transcriptions API (for AI review)
examples
server
#99 by
ngxson
was closed 2026-04-14 09:26
[Mirror] https://github.com/ggml-org/llama.cpp/pull/21711
examples
#97 by
ngxson
was closed 2026-04-12 21:59
[Mirror] feat: jinja engine improvements for reka-edge
testing
jinja parser
#96 by
ngxson
was closed 2026-04-09 10:17
Xsn/mistral small 4 (for CI)
python
testing
#92 by
ngxson
was closed 2026-03-16 23:31
convert_hf_to_gguf: fix flake8 E301 in Mistral4Model
python
#91 by
eauchs
was closed 2026-03-16 22:02
Xsn/mtmd debug (for CI)
examples
#90 by
ngxson
was closed 2026-03-13 21:22
[Mirror] llama: dynamic head_dim and n_rot for SWA
python
model
#89 by
ngxson
was closed 2026-03-09 21:22
[Mirror] Parse port numbers from MCP server URLs in CORS proxy
examples
python
server
#88 by
ngxson
was closed 2026-03-09 16:47
[Mirror] server : fix router child env in containerized environments
examples
server
#75 by
ngxson
was closed 2026-04-20 23:07
Jinja tests and `lstrip_block` fixes
testing
#73 by
aldehir
was merged 2026-01-04 17:40
jinja whitespace tests
testing
#72 by
aldehir
was merged 2026-01-04 13:17
fix(cuda): prevent UVM driver deadlock on SIGINT
examples
server
#71 by
dnhkng
was closed 2025-12-30 17:15
mimo2: wire RMS eps + MoE bias + converter guards
python
model
#69 by
ngxson
was merged 2025-12-24 11:27
[Mirror] Support Youtu-VL Model
examples
python
#67 by
ngxson
was closed 2025-12-25 09:28
Rebased ASR for LFM2-Audio-1.5B
examples
ggml
python
Nvidia GPU
testing
#58 by
tdakhran
was closed 2025-12-15 13:23
Xsn/cli arrow left right
#53 by
ServeurpersoCom
was merged 2025-12-07 12:29
WebUI updates for llama-server model management feature
examples
ggml
python
server
SYCL
Nvidia GPU
Vulkan
testing
devops
model
OpenCL
#43 by
allozaur
was merged 2025-11-29 22:55
Allozaur/server model management v1 2
examples
server
#42 by
allozaur
was merged 2025-11-24 10:48
Allozaur/server model management v1 2
examples
server
#41 by
ngxson
was merged 2025-11-22 17:32
PoC router/proxy server
examples
server
testing
#34 by
ngxson
was closed 2025-11-15 22:27
FOR CI | common: move download functions to download.(cpp|h)
#33 by
ngxson
was closed 2025-11-15 19:58
Mrope fix
examples
#31 by
ngxson
was merged 2025-10-28 14:56
ggml : refactor forward_dup for cpu backend
ggml
#30 by
xuanson2025
was merged 2025-09-18 02:05
Implement pixel unshuffle block for lfm2vl
examples
#29 by
tdakhran
was closed 2025-08-21 16:39
ggml_scale_bias
#27 by
ngxson
was closed 2025-07-10 11:01
Hunyuan tokenizer
python
#26 by
ngxson
was merged 2025-06-30 15:40
WIP Pixtral
examples
python
#22 by
ngxson
was closed 2025-04-22 14:23
clip : use smart pointers
examples
#21 by
ngxson
was closed 2025-04-10 09:07
Llama-4 mapping
python
#20 by
danielhanchen
was merged 2025-04-10 09:37
llama4 WIP
python
#19 by
ngxson
was closed 2025-04-07 07:53
Older