onnxruntime
Update LLaMA attention fusions
#19200
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
3
Changes
View On
GitHub
Update LLaMA attention fusions
#19200
kunal-vaishnavi
merged 3 commits into
microsoft:main
from
kunal-vaishnavi:kvaishnavi/llama-fix-attn-mask
Fix attention mask pattern matching
e44f3477
Add symbolic shape inference after optimization
f589fdfd
kunal-vaishnavi
added
release:1.17.0
RyanUnderhill
dismissed these changes on 2024-01-19
Update prerequisites
12890e57
kunal-vaishnavi
dismissed their stale review via
12890e57
2 years ago
RyanUnderhill
approved these changes on 2024-01-19
kunal-vaishnavi
merged
a3ecb632
into main
2 years ago
snnn
removed
release:1.17.0
Login to write a write a comment.
Login via GitHub
Reviewers
RyanUnderhill
Assignees
No one assigned
Labels
None yet
Milestone
No milestone
Login to write a write a comment.
Login via GitHub