Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
ngxson/llama.cpp
Pull Requests
Commits
Open
Closed
Xsn/chat fix typed content only (for CI)
jinja parser
#87 opened 2026-02-09 20:15 by
ngxson
Xsn/better tensor debug print (FOR CI)
model
#86 opened 2026-02-04 15:52 by
ngxson
[Mirror] server: /v1/responses (partial)
examples
python
server
#85 opened 2026-01-21 08:34 by
ngxson
cli : fix reasoning responses in CLI
examples
server
testing
#84 opened 2026-01-20 15:00 by
ngxson
[Mirror] server : refactor oai_parser_opt, move it to server_chat_params
examples
server
#83 opened 2026-01-19 21:46 by
ngxson
Glm4moelite
python
#82 opened 2026-01-19 19:05 by
ngxson
[Mirror] server: fix memory reservations in populate_token_probs
examples
server
#81 opened 2026-01-19 17:03 by
ngxson
[Mirror] server: improve slots scheduling for n_cmpl
examples
python
server
#80 opened 2026-01-12 17:24 by
ngxson
Xsn/remote preset
documentation
#79 opened 2026-01-08 14:35 by
ngxson
vendor : update cpp-httplib to 0.30.0
examples
python
server
testing
script
#78 opened 2026-01-07 14:56 by
ngxson
server: poc audio gen
examples
server
#77 opened 2026-01-07 11:43 by
ngxson
Demo: HTTP CORS proxy
examples
server
build
devops
#76 opened 2026-01-06 20:17 by
ngxson
[Mirror] server : fix router child env in containerized environments
examples
server
#75 opened 2026-01-05 11:14 by
ngxson
Xsn/jinja vm
documentation
examples
python
server
testing
script
#74 opened 2026-01-04 21:53 by
ngxson
[Mirror] feat: Add model pinning feature to protect critical models from LRU eviction
examples
server
#70 opened 2025-12-25 19:55 by
ngxson
[Mirror] server: (preset) add `unsafe-allow-api-override`
examples
server
#68 opened 2025-12-23 11:08 by
ngxson
[Mirror] mtmd: Add DeepSeekOCR Support
examples
ggml
python
Nvidia GPU
testing
model
#66 opened 2025-12-23 00:01 by
ngxson
[Mirror] New quantization type: Q3_HIFI
documentation
examples
ggml
python
SYCL
Nvidia GPU
Vulkan
testing
Apple Metal
#65 opened 2025-12-22 23:38 by
ngxson
[Mirror] Add Gemma3n multimodal support with MobileNetV5 vision encoder
examples
python
model
#64 opened 2025-12-22 23:17 by
ngxson
(FOR CI) Xsn/server data race
examples
server
#63 opened 2025-12-21 22:51 by
ngxson
Xsn/server sleep
examples
python
server
#62 opened 2025-12-20 15:22 by
ngxson
(FOR CI) Xsn/refactor server preset
examples
server
#61 opened 2025-12-18 20:52 by
ngxson
(FOR CI) Xsn/arch refactor llm names
documentation
#60 opened 2025-12-16 11:24 by
ngxson
ggml rope v2 demo
examples
ggml
server
testing
model
Apple Metal
#59 opened 2025-12-15 14:52 by
ngxson
Xsn/mtmd refactor audio preproc
examples
#57 opened 2025-12-13 13:30 by
ngxson
(FOR CI) Xsn/arg neg
examples
server
testing
#56 opened 2025-12-12 19:57 by
ngxson
(FOR CI) Xsn/clip refactor smaller files
examples
#55 opened 2025-12-12 15:25 by
ngxson
(FOR CI) Xsn/cli server based
examples
server
testing
devops
script
#54 opened 2025-12-10 12:57 by
ngxson
(FOR CI) console: allow using arrow left/right to edit the line
#52 opened 2025-12-07 00:14 by
ngxson
(FOR CI) Xsn/server improve spec
examples
server
#51 opened 2025-12-06 14:52 by
ngxson
Older