onnxruntime
Update replacing MultiHeadAttention with GroupQueryAttention
#19882
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
1
Changes
View On
GitHub
Update replacing MultiHeadAttention with GroupQueryAttention
#19882
kunal-vaishnavi
merged 1 commit into
microsoft:main
from
kunal-vaishnavi:kvaishnavi/update-gqa
Update replacing MHA with GQA
f9492a1e
aciddelgado
approved these changes on 2024-03-13
kunal-vaishnavi
merged
4ac98d6d
into main
1 year ago
kunal-vaishnavi
added
release:1.17.3
Login to write a write a comment.
Login via GitHub
Reviewers
aciddelgado
Assignees
No one assigned
Labels
release:1.17.3
Milestone
No milestone
Login to write a write a comment.
Login via GitHub