llama.cpp
d3f0c716 - Stop the generation when <|eom_id|> token is encountered - needed for Llama 3.1 tool call support (#8858)

Commit

1 year ago

Stop the generation when <|eom_id|> token is encountered - needed for Llama 3.1 tool call support (#8858) * gguf-py, llama : add constants and methods related to Llama-3.1 <|eom_id|> token * llama : find Llama-3.1 <|eom_id|> token id during vocab loading * llama-vocab : add Llama-3.1 <|eom_id|> token to the set of tokens stopping the generation --------- Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com>

References

#8858 - Stop the generation when <|eom_id|> token is encountered (needed for llama 3.1 tool call support)

Author

fairydreaming

Parents

e31a4f67

llama.cpp d3f0c716 - Stop the generation when <|eom_id|> token is encountered - needed for Llama 3.1 tool call support (#8858)

llama.cpp
d3f0c716 - Stop the generation when <|eom_id|> token is encountered - needed for Llama 3.1 tool call support (#8858)