llama.cpp
856c3ada - hexagon: eliminate scalar VTCM loads via HVX splat helpers (#22993)

Commit
27 days ago
hexagon: eliminate scalar VTCM loads via HVX splat helpers (#22993) * hexagon: add hvx_vec_repl helpers and use those for splat-from-vtcm usecase * hmx-mm: optimize per-group scale handling * hmx-fa: optimize slope load from vtcm * hmx-fa: use aligned access where possible in hmx-utils * hexagon: add hvx_vec_repl_2x_f16 helper and consolidate repl helpers --------- Co-authored-by: Max Krasnyansky <maxk@qti.qualcomm.com>
Parents
Loading