onnxruntime
Fuse attention node even in case of different Q,K hidden dimensions
#8106
Merged

Fuse attention node even in case of different Q,K hidden dimensions #8106

viboga merged 25 commits into master from Vish/opt_attn_qkv_update
viboga
viboga changes to fuse attention node and create varied dimensions
75583f73
viboga added an option to optimizer to only do offline fusion
0f83a68b
viboga fixing a typo
59294d3a
viboga merge with master
e98eaf03
viboga Merge remote-tracking branch 'origin/master' into Vish/optimizer_attn…
5c009f19
viboga viboga requested a review 4 years ago
viboga removing extra changes
79f1dea2
viboga viboga marked this pull request as draft 4 years ago
viboga viboga changed the title Vish/opt attn qkv update Fuse attention node even in case of different Q,K hidden dimensions 4 years ago
viboga added new unit test - test_attention_fusion_for_varied_qkv_dimensions()
9d77b9c8
viboga Unit test succesfull for q,k,v paths with varied dimensions
02a4c482
viboga adding test model for unit test case
370b6120
viboga optimizing attention tests
e4b6b23e
viboga removing debugs
5cec7c25
viboga viboga assigned wangyems wangyems 4 years ago
viboga viboga assigned tianleiwu tianleiwu 4 years ago
viboga viboga marked this pull request as ready for review 4 years ago
wangyems
wangyems commented on 2021-06-21
wangyems
wangyems commented on 2021-06-21
wangyems
wangyems commented on 2021-06-21
wangyems
wangyems commented on 2021-06-21
viboga minor change
4e52c0db
wangyems wangyems requested a review from wangyems wangyems 4 years ago
wangyems
wangyems dismissed these changes on 2021-06-21
viboga addressing comments
9f5159bb
viboga viboga dismissed their stale review via 9f5159bb 4 years ago
tianleiwu
tianleiwu commented on 2021-06-22
tianleiwu
tianleiwu commented on 2021-06-22
tianleiwu
tianleiwu commented on 2021-06-22
tianleiwu
tianleiwu commented on 2021-06-22
tianleiwu
tianleiwu commented on 2021-06-22
tianleiwu
tianleiwu commented on 2021-06-22
viboga addressing comments
4de23c01
viboga changed the new option to disable_onnxruntime
2d89fe63
tianleiwu
tianleiwu commented on 2021-06-22
viboga replacing asserts with debugs
48c9bc9a
viboga make attn fusion backward compatible for head_size, hidden_size
b7145ece
viboga preserving behavior for shape_modified_tensor
68a8dd16
viboga adding new option as the last parameter
f76c039f
viboga cleaning up
63752239
viboga line breaks and spaces
b1ad048d
viboga formatting according to python
b4c5ed3e
viboga viboga requested a review from tianleiwu tianleiwu 4 years ago
viboga viboga requested a review from wangyems wangyems 4 years ago
viboga making the changes to fuse attention node without user input
5221a743
viboga changes to fusion_attention.py updated
34deae75
viboga bringing the code up to python standard
8e924ced
tianleiwu
tianleiwu approved these changes on 2021-06-24
viboga viboga merged b478086b into master 4 years ago
viboga viboga deleted the Vish/opt_attn_qkv_update branch 4 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
Labels
Milestone