CUDA/HIP: refractor mmqv to unify the calculation of nwarps and rows per block between host and device code. #12177
refractor mmqv to unify the calculation of nwarps and rows per block …
888ffc87
IMbackK
force pushed
from
50d4277c
to
888ffc87
197 days ago
make cuda happy, as it dosent support calling host constexpr function…
a55d765d
Fix nits
b85a723a
Fix spelling of parameter
15f4dcaf
Update ggml/src/ggml-cuda/mmvq.cu
1b3894eb
IMbackK
merged
10f2e818
into master 189 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub