CUDA/HIP: refractor mmqv to unify the calculation of nwarps and rows per block between host and device code. #12177
refractor mmqv to unify the calculation of nwarps and rows per block …
888ffc87
IMbackK
force pushed
to
888ffc87
320 days ago
make cuda happy, as it dosent support calling host constexpr function…
a55d765d
Fix nits
b85a723a
Fix spelling of parameter
15f4dcaf
Update ggml/src/ggml-cuda/mmvq.cu
1b3894eb
IMbackK
merged
10f2e818
into master 313 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub