llama.cpp
ggml-cuda : perform cublas fp16 matrix multiplication as fp16
#3370

Merged

ggml-cuda : perform cublas fp16 matrix multiplication as fp16 #3370

ggerganov merged 3 commits into master from cublas-f16

ggml-cuda : perform cublas fp16 matrix multiplication as fp16

79fe5a1f

try to fix rocm build

32ada53c

cebtenzzre commented on 2023-09-27

restrict fp16 mat mul to volta and up

7d5674dd

ggerganov approved these changes on 2023-09-28

ggerganov merged da040034 into master 1 year ago

slaren deleted the cublas-f16 branch 1 year ago

Reviewers

ggerganov

cebtenzzre

Assignees

No one assigned

Labels

None yet

Milestone

No milestone