llama.cpp
1215ed7d - CUDA: Implemented row flattening for non-glm RoPE (#2468)

Commit
2 years ago
CUDA: Implemented row flattening for non-glm RoPE (#2468)
Parents
Loading