onnxruntime
Update LLaMA attention fusions
#19200
Merged

Update LLaMA attention fusions #19200

kunal-vaishnavi
kunal-vaishnavi Fix attention mask pattern matching
e44f3477
kunal-vaishnavi Add symbolic shape inference after optimization
f589fdfd
kunal-vaishnavi kunal-vaishnavi added release:1.17.0
RyanUnderhill
RyanUnderhill dismissed these changes on 2024-01-19
kunal-vaishnavi Update prerequisites
12890e57
kunal-vaishnavi kunal-vaishnavi dismissed their stale review via 12890e57 2 years ago
RyanUnderhill
RyanUnderhill approved these changes on 2024-01-19
kunal-vaishnavi kunal-vaishnavi merged a3ecb632 into main 2 years ago
snnn snnn removed release:1.17.0
snnn

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone