onnxruntime
f9cddd2c - Remove early stopping from LLaMA end-to-end benchmarking (#20033)

Commit

2 years ago

Remove early stopping from LLaMA end-to-end benchmarking (#20033) ### Description This PR removes early stopping from the end-to-end LLaMA-2 benchmark script. ### Motivation and Context This allows models to always generate the requested number of new tokens.

References

#20033 - Remove early stopping from LLaMA end-to-end benchmarking

Author

kunal-vaishnavi

Parents

7e84ba0e

onnxruntime f9cddd2c - Remove early stopping from LLaMA end-to-end benchmarking (#20033)

onnxruntime
f9cddd2c - Remove early stopping from LLaMA end-to-end benchmarking (#20033)