onnxruntime
Add LLaMA GQA ragged batching
#18337
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
5
Changes
View On
GitHub
Commits
Fix GQA ragged batching
kunal-vaishnavi
committed
2 years ago
Remove GQA left padding fix
kunal-vaishnavi
committed
2 years ago
Add changes suggested by linter
kunal-vaishnavi
committed
2 years ago
Merge branch 'main' into kvaishnavi/llama-fix-gqa-batching
kunal-vaishnavi
committed
2 years ago
Update README
kunal-vaishnavi
committed
2 years ago
Loading