llama.cpp
d9df1100 - HIP: use hipBLAS for dense prefill on gfx900, keep MMQ for MoE (#24588)

Commit
3 days ago
HIP: use hipBLAS for dense prefill on gfx900, keep MMQ for MoE (#24588) * HIP: keep MMQ for gfx900 MoE and Q8_0, use hipBLAS for dense K-quants Assisted-by: GitHub Copilot CLI * HIP: tighten conditional block to be explicitly for gfx900 * HIP: Further simplified gfx900 conditional block * removed unnecessary comment
Author
Parents
Loading