llama.cpp
d4156690
- cuda : add ROCm / hipBLAS cublasGemmBatchedEx define
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
cuda : add ROCm / hipBLAS cublasGemmBatchedEx define
References
#3749 - cuda : add batched cuBLAS GEMM for faster attention
Author
ggerganov
Parents
878aa4f2
Loading