CUDA: fuse rope + set_rows #16884
am17an
commented
on 2025-10-31
am17an
commented
on 2025-10-31
am17an
force pushed
47 days ago
CUDA: add fused rope
acfd03d3
move k forward_expand up
b3761df3
create helper function instead of re-using params
67b6580b
make assert statement more in line with comment
c7c3b9f1
rope_norm: coalesced writes to global mem
67624935
am17an
force pushed
to
67624935
35 days ago
am17an
merged
a90eb94c
into master 34 days ago
am17an
deleted the cuda-add-rope-fusion branch 34 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub