llama.cpp
cuda : update supports_op for matrix multiplication
#8245
Merged

cuda : update supports_op for matrix multiplication #8245

ggerganov merged 1 commit into master from sl/fix-cuda-supports
slaren
slaren1 year ago👍 2

Update supports_op to correctly reflect that bf16 is not supported, and prevent new types added in the future from being incorrectly reported as supported. This will also cause bf16 models to be run on the CPU rather than crashing when using a CUDA build.

slaren cuda : update supports_op for matrix multiplication
b1b3b00f
github-actions github-actions added testing
ggerganov
ggerganov approved these changes on 2024-07-02
ggerganov ggerganov merged 0e0590ad into master 1 year ago
slaren slaren deleted the sl/fix-cuda-supports branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone