llama.cpp
3f460a2b
- cuda : add RoPE kernel for mode == 2 (NeoX) (#2760)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
cuda : add RoPE kernel for mode == 2 (NeoX) (#2760) * cuda : add RoPE kernel for mode == 2 (NeoX) * falcon : do not offload the embeddings layer
References
#2760 - cuda : add RoPE kernel for mode == 2 (NeoX)
Author
ggerganov
Parents
87e3733f
Loading