llama.cpp
e1e8e099 - CUDA: batched+noncont MMQ, refactor bs>1 MoE code (#13199)

Commit
261 days ago
CUDA: batched+noncont MMQ, refactor bs>1 MoE code (#13199)
Parents
Loading