onnxruntime
f9cddd2c - Remove early stopping from LLaMA end-to-end benchmarking (#20033)

Commit
2 years ago
Remove early stopping from LLaMA end-to-end benchmarking (#20033) ### Description This PR removes early stopping from the end-to-end LLaMA-2 benchmark script. ### Motivation and Context This allows models to always generate the requested number of new tokens.
Parents
Loading