llama.cpp
CUDA: Quantized matrix matrix multiplication
#2160
Merged

CUDA: Quantized matrix matrix multiplication #2160

JohannesGaessler
JohannesGaessler JohannesGaessler changed the title Cuda matrix matrix 6 CUDA: Quantized matrix matrix multiplication 2 years ago
slaren
JohannesGaessler
cmp-nct
JohannesGaessler JohannesGaessler force pushed from 0f2a62c6 to 60df883a 2 years ago
JohannesGaessler
JohannesGaessler JohannesGaessler force pushed from 60df883a to 31f229c7 2 years ago
JohannesGaessler
abc-nix
JohannesGaessler
cmp-nct
JohannesGaessler
JohannesGaessler JohannesGaessler force pushed from d8e26973 to a3b096b4 2 years ago
abc-nix
JohannesGaessler JohannesGaessler force pushed from 84f787eb to cf0a5051 2 years ago
JohannesGaessler
slaren
ggerganov
cmp-nct
JohannesGaessler
cmp-nct
JohannesGaessler
cmp-nct
JohannesGaessler
cmp-nct
JohannesGaessler JohannesGaessler force pushed from cf0a5051 to 5fa10641 2 years ago
JohannesGaessler
JohannesGaessler mmq implementation for non k-quants
ddb37bf8
JohannesGaessler q6_K
4b3af63e
JohannesGaessler q2_K
5bff3df0
JohannesGaessler q3_k
a62bcc89
JohannesGaessler q4_K
b59cd1dc
JohannesGaessler vdr
5d8b3de4
JohannesGaessler q5_K
b53e7138
JohannesGaessler JohannesGaessler force pushed from 5fa10641 to b53e7138 2 years ago
JohannesGaessler
JohannesGaessler faster q8_1 loading
a3505fac
JohannesGaessler loop unrolling
6808800c
JohannesGaessler add __restrict__
58daf95a
JohannesGaessler q2_K sc_high
abed4463
JohannesGaessler GGML_CUDA_MMQ_Y
3c09e11c
JohannesGaessler Updated Makefile
038ed631
JohannesGaessler Update Makefile
495c8981
JohannesGaessler DMMV_F16 -> F16
656c1ab3
JohannesGaessler
JohannesGaessler Updated README, CMakeLists
aa4b2c93
JohannesGaessler JohannesGaessler marked this pull request as ready for review 2 years ago
JohannesGaessler
slaren
slaren commented on 2023-07-29
JohannesGaessler Fix CMakeLists.txt
c0dfd5a5
JohannesGaessler Fix CMakeLists.txt
0b5f9891
JohannesGaessler Fix multi GPU out-of-bounds
0bb22bb4
slaren
slaren approved these changes on 2023-07-29
JohannesGaessler JohannesGaessler merged 11f3ca06 into master 2 years ago
mirek190
JohannesGaessler
mirek190
JohannesGaessler
Green-Sky
JohannesGaessler
Dampfinchen
Green-Sky
Dampfinchen
Dampfinchen
Dampfinchen
LostRuins
JohannesGaessler
nauful
dranger003
LostRuins
LostRuins
dranger003

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone