llama.cpp
hexagon: optimize HMX matmul operations
#21071
Open

hexagon: optimize HMX matmul operations #21071

chraac wants to merge 17 commits into ggml-org:master from chraac:dev-hmx-opt
chraac
chraac optimize hmx_mat_mul functions by calculating row and column tiles up…
de56c35d
chraac refactor core_dot_chunk_fp16 to use size_t for tile counts and improv…
b2b21a36
chraac wip
5e18f4ed
chraac set scale outside of loop
a2628321
chraac wip
ee95d92a
chraac refactor core_mma_chunk_fp16 and mat_mul_qk_0_d16a32 to use size_t fo…
33d94316
chraac wip
3a97015f
chraac wip
6e291d8e
chraac refactor transfer_output_chunk_fp16_to_fp32 to use size_t for dimensions
f43d68ca
chraac refactor core_dot_chunk_fp16 to use size_t for tile row stride calcul…
42bd08ca
chraac wip
ee951463
chraac refactor hmx_mat_mul functions to use hvx_vec_splat_f16 for column sc…
91d88a35
chraac chraac requested a review 16 days ago
github-actions github-actions added ggml
github-actions github-actions added Hexagon
chraac
chraac commented on 2026-03-27
chraac refactor hmx_mat_mul_permuted_w16a32_batched to streamline scale sett…
55d72581
chraac refactor core_dot_chunk_fp16 to improve tile stride calculations for …
362c62ce
chraac Merge branch 'master' into dev-hmx-opt
e31e30a6
max-krasnyansky
chraac
chraac refactor hmx_mat_mul functions to use Q6_V_vsplat_R for column scales…
7c1a5a39
chraac Merge branch 'master' into dev-hmx-opt
3cd80415
max-krasnyansky

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone