fp32 conversion on cdna in ggml_cuda_op_mul_mat_cublas
#11356

Merged

Avoid fp32->fp16->fp32 conversion on cdna in ggml_cuda_op_mul_mat_cublas #11356

JohannesGaessler merged 1 commit into ggml-org:master from IMbackK:cdna_opt

IMbackK requested a review from

JohannesGaessler 1 year ago

github-actions added Nvidia GPU

github-actions added ggml

JohannesGaessler approved these changes on 2025-01-24

IMbackK force pushed 1 year ago

Avoid fp32->fp16->fp32 conversion on cdna in ggml_cuda_op_mul_mat_cublas

9d9ac6aa

IMbackK force pushed to 9d9ac6aa 1 year ago

JohannesGaessler merged 9fbadaef into master 1 year ago

Reviewers

JohannesGaessler

Assignees

No one assigned

Labels

Nvidia GPU ggml

Milestone

No milestone