llama.cpp
CUDA: Add BF16 path to CUBLAS and increase precision of FP16 path
#20078
Open
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
2
Changes
View On
GitHub
Commits
CUDA: Add BF16 CUBLAS path
ORippler
committed
8 days ago
CUBLAS: Use FP32 accumulation also for VOLTA TCs
ORippler
committed
8 days ago
Loading