llama.cpp
a3c28439 - cuda : fine-tune >= VOLTA params + use MMQ only for small batches

Commit
2 years ago
cuda : fine-tune >= VOLTA params + use MMQ only for small batches
Author
Parents
Loading