llama.cpp
10f2e818 - CUDA/HIP: refractor mmqv to unify the calculation of nwarps and rows per block between host and device code. (#12177)

Commit
180 days ago
CUDA/HIP: refractor mmqv to unify the calculation of nwarps and rows per block between host and device code. (#12177) refactor mmqv to unify the calculation of nwarps and rows per block between host and device code. --------- Co-authored-by: Johannes Gäßler <johannesg@5d6.de>
Author
Parents
Loading