llama.cpp
8b20858e
- perplexity : faster Winogrande via batching (#5024)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
perplexity : faster Winogrande via batching (#5024) * perplexity : faster Winogrande via batching ggml-ci * perplexity : remove unused function * perplexity : only tokenize selected tasks for Winogrande
References
#5024 - perplexity : faster Winogrande via batching
Author
ggerganov
Parents
57e2a7a5
Loading