ggml
79ea7a41 - cuda : support flattened GLM-style rope to reduce kernel launch (#477)

Commit
2 years ago
cuda : support flattened GLM-style rope to reduce kernel launch (#477)
Author
Parents
Loading