llama.cpp
6272b676
- use stride=128 if built for tensor cores
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
use stride=128 if built for tensor cores
References
ceb/perf-faster-multigpu
Author
cebtenzzre
Parents
dd71a35c
Loading