cuda: optimize iq2xxs/iq2xs/iq3xxs dequantization #19624
cuda: optimize iq2xxs/iq2xs/iq3xxs dequantization
be3a90c9
dfriehs
force pushed
from
da09e4ff
to
be3a90c9
5 days ago
dfriehs
changed the title cuda: optimize iq2xxs dequantization cuda: optimize iq2xxs/iq2xs/iq3xxs dequantization 5 days ago
cuda: iq2xxs: simplify sum scaling
dfc0e2c9
am17an
commented
on 2026-02-15
uint -> uint32_t
b82a9807
am17an
approved these changes
on 2026-02-15
am17an
merged
27b93cbd
into master 4 days ago
dfriehs
deleted the iq2xxs-cuda branch 4 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub