mmq.cu: tune mmq/rocblas switching for RDNA #18537
Patch perf regression for mmq kernels in ROCm
51e3d385
add n_experts branch like the cdna path
69a9a68b
mmq.cu: tune mmq/wmma switching for RDNA
a435c772
Beinsezii
changed the title mmq.cu: tune mmq/wmma switching for RDNA mmq.cu: tune mmq/rocblas switching for RDNA 26 days ago
IMbackK
approved these changes
on 2026-01-02
mmq.cu: move amd wmma mmq/wmma switching behind IS_RDNA3
3326fa23
Update ggml/src/ggml-cuda/mmq.cu
3fef966d
Beinsezii
deleted the beinsezii/rocm_mmq_tune branch 22 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub