ggml-cuda : perform cublas fp16 matrix multiplication as fp16 #3370
ggml-cuda : perform cublas fp16 matrix multiplication as fp16
79fe5a1f
try to fix rocm build
32ada53c
restrict fp16 mat mul to volta and up
7d5674dd
ggerganov
approved these changes
on 2023-09-28
ggerganov
merged
da040034
into master 1 year ago
slaren
deleted the cublas-f16 branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub