llama.cpp
hexagon: eliminate scalar VTCM loads via HVX splat helpers
#22993
Merged

hexagon: eliminate scalar VTCM loads via HVX splat helpers #22993

trivikram-reddy1
max-krasnyansky hexagon: add hvx_vec_repl helpers and use those for splat-from-vtcm u…
219d281b
max-krasnyansky hmx-mm: optimize per-group scale handling
24485261
max-krasnyansky hmx-fa: optimize slope load from vtcm
ce799b9e
max-krasnyansky hmx-fa: use aligned access where possible in hmx-utils
e6171dd4
trivikram-reddy1 hexagon: add hvx_vec_repl_2x_f16 helper and consolidate repl helpers
72ebe045
trivikram-reddy1 trivikram-reddy1 requested a review 27 days ago
github-actions github-actions added script
github-actions github-actions added ggml
github-actions github-actions added Hexagon
trivikram-reddy1
lhez
lhez approved these changes on 2026-05-12
max-krasnyansky
max-krasnyansky approved these changes on 2026-05-13
max-krasnyansky max-krasnyansky merged 856c3ada into master 27 days ago
trivikram-reddy1 trivikram-reddy1 deleted the tr/hvx-splat-vtcm branch 26 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone