llama.cpp
CUDA: Add BF16 path to CUBLAS and increase precision of FP16 path
#20078
Open

Commits
  • CUDA: Add BF16 CUBLAS path
    ORippler committed 8 days ago
  • CUBLAS: Use FP32 accumulation also for VOLTA TCs
    ORippler committed 8 days ago
Loading