llama.cpp
hexagon: eliminate scalar VTCM loads via HVX splat helpers
#22993

Merged

Commits

hexagon: add hvx_vec_repl helpers and use those for splat-from-vtcm usecase

max-krasnyansky committed 37 days ago
hmx-mm: optimize per-group scale handling

max-krasnyansky committed 37 days ago
hmx-fa: optimize slope load from vtcm

max-krasnyansky committed 37 days ago
hmx-fa: use aligned access where possible in hmx-utils

max-krasnyansky committed 37 days ago
hexagon: add hvx_vec_repl_2x_f16 helper and consolidate repl helpers

trivikram-reddy1 committed 36 days ago