llama.cpp
e1e8e099
- CUDA: batched+noncont MMQ, refactor bs>1 MoE code (#13199)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
163 days ago
CUDA: batched+noncont MMQ, refactor bs>1 MoE code (#13199)
References
#13199 - CUDA: batched+noncont MMQ, refactor bs>1 MoE code
Author
JohannesGaessler
Parents
6f67cf1f
Loading