llama.cpp
CUDA: Add BF16 path to CUBLAS and increase precision of FP16 path
#20078
Open

CUDA: Add BF16 path to CUBLAS and increase precision of FP16 path #20078

ORippler
ORippler CUDA: Add BF16 CUBLAS path
5296977b
ORippler CUBLAS: Use FP32 accumulation also for VOLTA TCs
9c3032b7
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
JohannesGaessler
JohannesGaessler
ORippler
ORippler
IMbackK
ORippler
IMbackK
IMbackK

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone