onnxruntime
Reduce LLaMA memory usage
#18181
Merged

Reduce LLaMA memory usage #18181

kunal-vaishnavi
kunal-vaishnavi Reduce memory usage during export and benchmarking
53a6cef6
github-advanced-security
github-advanced-security commented on 2023-10-31
kunal-vaishnavi Add changes suggested by linter
3c02b99b
kunal-vaishnavi kunal-vaishnavi added release:1.16.2
kunal-vaishnavi kunal-vaishnavi added sdxl_llama
kunal-vaishnavi Fix CodeQL error
094e58ca
frank-dong-ms
frank-dong-ms commented on 2023-10-31
frank-dong-ms
frank-dong-ms commented on 2023-10-31
kunal-vaishnavi Update max sequence length
d998c3cf
kunal-vaishnavi Add changes from PR feedback
428280e7
frank-dong-ms
frank-dong-ms commented on 2023-10-31
frank-dong-ms
frank-dong-ms dismissed these changes on 2023-10-31
kunal-vaishnavi Update max sequence lengths
019556e1
kunal-vaishnavi kunal-vaishnavi dismissed their stale review via 019556e1 2 years ago
kunal-vaishnavi Remove max sequence length as optional argument
f2c61d88
tianleiwu
tianleiwu approved these changes on 2023-11-01
kunal-vaishnavi kunal-vaishnavi merged d1b85f5f into main 2 years ago
tianleiwu tianleiwu removed release:1.16.2
tianleiwu tianleiwu removed sdxl_llama

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone