llama.cpp
hexagon: eliminate scalar VTCM loads via HVX splat helpers
#22993
Merged

Commits
  • hexagon: add hvx_vec_repl helpers and use those for splat-from-vtcm usecase
    max-krasnyansky committed 37 days ago
  • hmx-mm: optimize per-group scale handling
    max-krasnyansky committed 37 days ago
  • hmx-fa: optimize slope load from vtcm
    max-krasnyansky committed 37 days ago
  • hmx-fa: use aligned access where possible in hmx-utils
    max-krasnyansky committed 37 days ago
  • hexagon: add hvx_vec_repl_2x_f16 helper and consolidate repl helpers
    trivikram-reddy1 committed 36 days ago
Loading