llama.cpp
1d72c841
- CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16 (#15131)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
64 days ago
CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16 (#15131) * CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16
References
#15131 - CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16
Author
JohannesGaessler
Parents
20638e4f
Loading