llama.cpp
CUDA: mul_mat_vec_q tiling, refactor mul mat logic
#5434
Merged

CUDA: mul_mat_vec_q tiling, refactor mul mat logic #5434

JohannesGaessler
JohannesGaessler CUDA: mul_mat_vec_q tiling, refactor mul mat logic
97f8a7a2
cebtenzzre
slaren
Artefact2
slaren
JohannesGaessler fix AMD
2bb97fca
ggerganov
Artefact2
JohannesGaessler
JohannesGaessler revert low register pressure changes
76a0128b
JohannesGaessler
ggerganov
JohannesGaessler
JohannesGaessler
JohannesGaessler
ggerganov
JohannesGaessler
JohannesGaessler
ggerganov
slaren
ggerganov
JohannesGaessler
JohannesGaessler
slaren
ggerganov
ggerganov approved these changes on 2024-02-11
ggerganov ggerganov requested a review from slaren slaren 2 years ago
slaren
slaren commented on 2024-02-11
JohannesGaessler refactor fp16 logic, only consider used devices
005de593
slaren
slaren commented on 2024-02-11
JohannesGaessler refactor boolean logic
b1f6fab6
JohannesGaessler any_pascal fixup
a3a46580
JohannesGaessler Update ggml-cuda.cu
763083e5
slaren
slaren approved these changes on 2024-02-11
JohannesGaessler JohannesGaessler merged 3bdc4cd0 into master 2 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone