Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
ggerganov/llama.cpp
Pull Requests
Commits
Open
Closed
Stub out bc6h support
examples
ggml
#16419 by
monocasa
was closed 2025-10-04 02:34
vulkan: use a more appropriate amount of threads when generating shaders
Vulkan
ggml
#16418 by
netrunnereve
was merged 2025-10-04 20:04
ci : refactor sdk caching to minimize storage
devops
#16414 by
CISC
was merged 2025-10-06 15:40
Magistral
testing
#16413 by
ServeurpersoCom
was merged 2025-10-03 18:51
metal : fix loop bound in ggml_mem_ranges
ggml
Apple Metal
#16412 by
ggerganov
was merged 2025-10-03 16:18
llama : fix shapes for bert/mpt q/k norm
#16409 by
CISC
was merged 2025-10-03 12:40
Fix missing messages on sibling navigation
examples
server
#16408 by
allozaur
was merged 2025-10-03 10:51
Capture model name only after first token (streaming) or completed request
examples
server
#16405 by
allozaur
was merged 2025-10-03 09:30
Fix messages payload sent to chat completions
server/webui
examples
bugfix
server
#16402 by
allozaur
was merged 2025-10-03 07:11
ci : change macos-13 to macos-15-intel
devops
#16401 by
danbev
was merged 2025-10-03 09:45
ggml webgpu: actually add softmax, fix rms_norm offset
ggml
#16400 by
reeselevine
was merged 2025-10-05 03:59
ggml : fix graph reallocation with multiple chunks
testing
ggml
#16396 by
Acly
was merged 2025-10-03 11:49
refactor: centralize CoT parsing in backend for streaming mode
testing
examples
server
#16394 by
ServeurpersoCom
was merged 2025-10-08 20:18
server : host-memory prompt caching
examples
python
server
#16391 by
ggerganov
was merged 2025-10-09 15:54
test-barrier : do not use more threads than physically available
testing
#16389 by
CISC
was merged 2025-10-02 18:10
ci : attempt to fix ubuntu-latest-cmake-rpc
devops
#16388 by
CISC
was merged 2025-10-02 11:51
model-conversion : add support for SentenceTransformers
examples
python
#16387 by
danbev
was merged 2025-10-09 12:35
implement context checkpointing for hybrid and recurrent models
examples
server
#16382 by
ddh0
was merged 2025-10-03 18:34
ci : fix clean-up of old logs
devops
#16381 by
ggerganov
was merged 2025-10-02 07:35
tests : add -INF blocks to the KQ mask in the FA tests
testing
ggml
#16380 by
ggerganov
was merged 2025-10-07 05:22
common : fix Hermes/Qwen tool-call parsing to stop wrapper leaks
testing
#16378 by
laundrevity
was closed 2025-10-02 19:02
common : fix Hermes tool-call parser leaking wrapper XML
testing
#16377 by
laundrevity
was closed 2025-10-02 00:04
CI: reenable cdna in rocm docker builds
devops
#16376 by
IMbackK
was merged 2025-10-01 21:32
HIP: add myself as codeowner
#16375 by
IMbackK
was merged 2025-10-02 03:52
common: introduce http.h for httplib-based client
examples
#16373 by
angt
was merged 2025-10-01 17:22
metal : mark FA blocks
testing
ggml
Apple Metal
#16372 by
ggerganov
was merged 2025-10-08 07:57
[SYCL]Update to oneAPI 2025.2
documentation
devops
SYCL
#16371 by
NeoZhangJianyu
was merged 2025-10-02 07:16
Conversation action dialogs as singletons from Chat Sidebar + apply conditional rendering for Actions Dropdown for Chat Conversation Items
server/webui
examples
bugfix
server
#16369 by
allozaur
was merged 2025-10-01 16:18
Fix Tether CI CMake cmake pkg
#16368 by
jesusmb1995
was closed 2025-10-01 11:28
model: EmbeddingGemma Adding Support for SentenceTransformers Dense Modules
python
#16367 by
sfallah
was merged 2025-10-09 06:39
Newer
Older