llama.cpp
8ee2a68d
- grammar: reuse decoded tokens and pieces
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
grammar: reuse decoded tokens and pieces
References
grammar-speedup
Author
Olivier Chafik
Parents
15fe172c
Loading