llama.cpp
llama : rotate activations for better quantization
#21038
Merged

llama : rotate activations for better quantization #21038

ggerganov merged 12 commits into master from gg/attn-rot
ggerganov
ubergarm
AesSedai
Dampfinchen
ggerganov
Rotatingxenomorph
AesSedai
Dampfinchen
ggerganov ggerganov force pushed from 7711b3a3 to 5e60035f 75 days ago
ggerganov ggerganov marked this pull request as ready for review 75 days ago
ggerganov ggerganov requested a review from CISC CISC 75 days ago
ggerganov
CISC
CISC approved these changes on 2026-03-28
am17an
handpickencounter
CISC
am17an
Rotatingxenomorph
CISC
Dampfinchen
CISC
am17an
CISC
ggerganov ggerganov force pushed from 5e60035f to e05a5045 74 days ago
ggerganov
Dampfinchen
am17an
ggerganov
am17an
ggerganov
erazortt
ryrAiy
pwilkin
mirek190
Meltedd
erazortt
segmond
pwilkin
strawberrymelonpanda
pheonix-delta
dagbdagb
pheonix-delta
erazortt
ggerganov ggerganov force pushed from e05a5045 to c35f75d0 72 days ago
ggerganov
pwilkin
pwilkin approved these changes on 2026-03-31
ggerganov llama : rotate activations for better quantization
4d68f97e
ggerganov cont : rotate V more + refactor
cb6e21d8
ggerganov cont : rotate caches separately + support non-power-of-2 head sizes
d467d3d0
ggerganov cont : simplify
898a8fe6
ggerganov cont : add reference for V rotation
eaefe0f0
ggerganov cont : refactor
66b72b41
ggerganov cont : support context shift
62adb6a1
ggerganov cont : consolidate
d424a885
ggerganov cont : dedup + allow different types for the rotation matrix
69e476d3
ggerganov cont : add env variable to disable rotation
29f41968
ggerganov cont : simplify attn rot kv cache logic + rename env
8df24c07
ggerganov cont : pre-compute the Hadamard matrices
a0c4a2a7
ggerganov ggerganov force pushed from c35f75d0 to a0c4a2a7 72 days ago
ggerganov ggerganov merged 744c0c73 into master 71 days ago
ggerganov ggerganov deleted the gg/attn-rot branch 71 days ago
nawoa
Rotatingxenomorph
EAddario
pwilkin
nawoa
pwilkin
JeroenAdam
CISC
erazortt
pwilkin
Dampfinchen
vektorprime

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone