llama.cpp
a90eb94c - CUDA: fuse rope + set_rows (#16884)

Commit
31 days ago
CUDA: fuse rope + set_rows (#16884) * CUDA: add fused rope * move k forward_expand up * create helper function instead of re-using params * make assert statement more in line with comment * rope_norm: coalesced writes to global mem
Author
Parents
Loading