onnxruntime
Add LLaMA GQA ragged batching
#18337
Merged

Add LLaMA GQA ragged batching #18337

kunal-vaishnavi
kunal-vaishnavi Fix GQA ragged batching
7ccec207
kunal-vaishnavi Remove GQA left padding fix
645225ec
kunal-vaishnavi Add changes suggested by linter
21fd8559
kunal-vaishnavi kunal-vaishnavi added release:1.16.2
kunal-vaishnavi kunal-vaishnavi added sdxl_llama
kunal-vaishnavi Merge branch 'main' into kvaishnavi/llama-fix-gqa-batching
1eabc508
kunal-vaishnavi Update README
d1b5511b
tianleiwu tianleiwu requested a review from frank-dong-ms frank-dong-ms 2 years ago
yufenglee
yufenglee approved these changes on 2023-11-08
tianleiwu tianleiwu merged c8def0cc into main 2 years ago
tianleiwu
tianleiwu commented on 2023-11-08
tianleiwu tianleiwu removed release:1.16.2
tianleiwu tianleiwu removed sdxl_llama

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone