llama.cpp
d3f0c716 - Stop the generation when <|eom_id|> token is encountered - needed for Llama 3.1 tool call support (#8858)

Commit
1 year ago
Stop the generation when <|eom_id|> token is encountered - needed for Llama 3.1 tool call support (#8858) * gguf-py, llama : add constants and methods related to Llama-3.1 <|eom_id|> token * llama : find Llama-3.1 <|eom_id|> token id during vocab loading * llama-vocab : add Llama-3.1 <|eom_id|> token to the set of tokens stopping the generation --------- Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com>
Author
Parents
Loading