llama.cpp
1ee9d0b4
- CUDA: use fastdiv + ggml_cuda_mad for mmvf (#16557)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
21 days ago
CUDA: use fastdiv + ggml_cuda_mad for mmvf (#16557) * CUDA: use fastdiv + ggml_cuda_mad for mmvf * use bf16 directly + fix formatting * Add exception for HIP code
References
#16557 - CUDA: use fastdiv + ggml_cuda_mad for mmvf
Author
am17an
Parents
48e2fa9f
Loading