llama.cpp
a90eb94c
- CUDA: fuse rope + set_rows (#16884)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
31 days ago
CUDA: fuse rope + set_rows (#16884) * CUDA: add fused rope * move k forward_expand up * create helper function instead of re-using params * make assert statement more in line with comment * rope_norm: coalesced writes to global mem
References
#16884 - CUDA: fuse rope + set_rows
Author
am17an
Parents
07751f8d
Loading