hexagon: Q4_0 and MXFP4 repack fixes #20527
hexagon: fix tail corruption with rows sizes not multiple of 256
4c33e4dd
hexagon: use different stride for repacking partial blocks
40fab2c2
hex-mm: update repack and kernels to avoid shuffles for full 256-elem…
47f5b864
hex-mm: update rmpy x8 for better optimizations
e445dbf1
hex-mm: tighten supported MUL_MAT checks to avoid spurios failures
27243fc3
hex-mm: use vzero to init accumulators
a769f7c2
lhez
approved these changes
on 2026-03-14
hex-mm: properly call partial rmpy_x8
efea8560
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub