llama.cpp
CUDA: Add BF16 path to CUBLAS and increase precision of FP16 path
#20078

Open

Commits

CUDA: Add BF16 CUBLAS path

ORippler committed 8 days ago
CUBLAS: Use FP32 accumulation also for VOLTA TCs

ORippler committed 8 days ago

Loading