llama.cpp
Vectorize load instructions in dmmv f16 CUDA kernel
#9816
Merged

Vectorize load instructions in dmmv f16 CUDA kernel #9816

agray3
agray3 Vectorize load instructions in dmmv f16 CUDA kernel
95c8b9c1
github-actions github-actions added Nvidia GPU
agray3
JohannesGaessler
JohannesGaessler commented on 2024-10-10
agray3 addressed comment
d07dc44c
agray3
JohannesGaessler
JohannesGaessler commented on 2024-10-10
agray3 Update ggml/src/ggml-cuda/dmmv.cu
d150c7e3
JohannesGaessler
JohannesGaessler approved these changes on 2024-10-10
slaren slaren merged 13dca2a5 into master 1 year ago
JohannesGaessler

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone