Reduce LLaMA memory usage #18181
Reduce memory usage during export and benchmarking
53a6cef6
Add changes suggested by linter
3c02b99b
Fix CodeQL error
094e58ca
Update max sequence length
d998c3cf
Add changes from PR feedback
428280e7
Update max sequence lengths
019556e1
Remove max sequence length as optional argument
f2c61d88
tianleiwu
approved these changes
on 2023-11-01
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub