ggml-hexagon: gelu optimization #18151
joeldushouyu
changed the title Hexagon gelu optimization ggml-hexagon: gelu optimization 23 days ago
chraac
commented
on 2025-12-18
feat: working gelu with src0 put on vtcm
59c88691
feat: gelu ping-pong for both in and out
423a7f80
fix: fixu compile error
1c5ffc4b
break: distinguish dma ddr->vtcm and vtcm->ddr operation
22161873
fix: fix dma queue size
3a607f37
break: update dma api to either pop src or dst ptr
29a46c32
fix: fix activation vtcm allocation issue for src1 when swapperd
4032576e
refactor: ping-pong gelu logic to avoid unnecessary if else
e9f13ba5
dma: improved queue interface and prefetch handling
dccbcb2c
joeldushouyu
force pushed
from
f590092f
to
dccbcb2c
18 days ago
joeldushouyu
marked this pull request as ready for review 18 days ago
gelu: fix N+2 block prefetch
4ad1507d
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub