Pull Requests ggml-org/ggml

tests: Take over test-{alloc,gguf} from llama.cpp

#1551 opened 2026-06-30 18:32 by ckastner

sycl: fix soft_max_f32 max reduction

#1534 opened 2026-06-10 09:43 by someoneinjd

metal : graceful failure instead of GGML_ABORT

#1527 opened 2026-06-03 10:49 by fiorelorenzo

ggml-cpu : fix soft_max_back wrong output when dst aliases src1 (y)

#1521 opened 2026-05-28 19:35 by devYRPauli

ggml : assert b->ne[3] == 1 in ggml_conv_transpose_2d_p0 (#1448)

#1520 opened 2026-05-28 19:17 by devYRPauli

tests : add OLMoE-sized K-quant shapes to test_mul_mat_id (ref #1506)

#1518 opened 2026-05-28 18:31 by devYRPauli

Fix build and test running for Windows MSVC

#1512 opened 2026-05-26 22:16 by MaxwellGengYF

docs(gguf): extend Encoding slot to support percentage-mixed recipes

#1489 opened 2026-05-13 09:43 by mishig25

ggml-cpu/riscv: gate cpu-riscv64 backend on Zv* sub-extensions

#1475 opened 2026-05-02 23:39 by mikey

opencl: add Adreno xmem attention fast path

#1461 opened 2026-04-19 12:19 by happyyzy

Update GGUF description to include full name

#1457 opened 2026-04-02 21:20 by mmontag

fix: gate AVX/AVX-512/AMX on OS XSAVE support via XGETBV

#1435 opened 2026-03-11 09:01 by Mattbusel

ggml-cpu: BMI2 is only available on amd64

#1392 opened 2025-11-26 18:15 by candrews

metal: implement cross-entropy and count-equal ops for MNIST

#1390 opened 2025-11-22 07:02 by xi-guo-0

metal: accelerated conv2d

#1384 opened 2025-11-11 18:38 by bghira

ggml-cpu : Add GGML_CPU_FFAST_MATH for sine autovectorization

#1243 opened 2025-05-28 06:26 by danielzgtg

Temporary fix for rms norm backward on CPU.

#1197 opened 2025-04-22 21:15 by Alkamist

Update example readme/shell output for gpt-2-backend

#1159 opened 2025-03-26 20:59 by rememberlenny

Add an option not to abort on cuda OOM

#1110 opened 2025-02-11 20:43 by WilliamTambellini

ggml : add `ggml_fft` and `ggml_ifft` operator

#1105 opened 2025-02-06 07:36 by ilhamsyahids

ggml-cpu: use lookuptable for ggml op and parallelized some of the memcpy memset etc. calls before ggml_barriers

#1101 opened 2025-02-04 05:20 by Kamayuq

CUDA: update compilation flags for improved performance

#1099 opened 2025-02-03 18:02 by royshil

ggml-cpu: Add bounds checking in `make_block_q4_0x4` function

#1095 opened 2025-02-02 18:50 by ilhamsyahids

GGUF: ggml backend support for writing tensor data

#1033 opened 2024-11-30 10:34 by JohannesGaessler

Add conv_transpose_1d_gemm

#940 opened 2024-08-28 22:30 by smeso

feat: implement backward computation for more operators

#921 opened 2024-08-12 20:11 by Ronsor

feat: adds new OP ggml_unfold_1d

#867 opened 2024-06-21 20:49 by balisujohn

feat: adds `ggml_pad_ext` to allow prefix padding

#864 opened 2024-06-18 08:42 by balisujohn

[experimental]backend: add new oneDNN backend

#855 opened 2024-06-12 15:44 by rfsaliev

Embed yolo files

#831 opened 2024-05-19 11:32 by katsu560