onnxruntime
Annotation based partitioning along with resource accounting
#27595
Merged

Annotation based partitioning along with resource accounting #27595

tianleiwu merged 63 commits into main from yuslepukhin/layering
yuslepukhin
yuslepukhin Introduce annotation query function
18b5d2bf
yuslepukhin Wire annotations to partitioning interface.
1eeba1a1
yuslepukhin Fix up annotations with Transpose Optimizer
debc8dd2
yuslepukhin Add ORT_EXTENDED_MINIMAL build
8f0ff86e
yuslepukhin Move rules and matcher inside the index
f7a422ed
yuslepukhin Add Update with tests
1ef4078c
yuslepukhin TODO: Consider not removing annotations
828eca39
yuslepukhin Clear annotations after partitioning
1626d5c0
yuslepukhin Merge branch 'main' into yuslepukhin/layering
f33bab1a
yuslepukhin Address accountant bug
b3ecb39f
yuslepukhin Annotate tiny_gpt2_beamsearch by layers
b5ea1c6e
yuslepukhin Refactor Graph_GetGraphView to make it a utility
9a422bae
yuslepukhin Introduce a graph utility to create an IndexedSubgraph
a1caf934
yuslepukhin Merge branch 'main' into yuslepukhin/layering
e1b1c4f8
yuslepukhin Fix lint in python script.
acec4024
yuslepukhin Merge branch 'main' into yuslepukhin/layering
31dd7a84
yuslepukhin Merge branch 'main' into yuslepukhin/layering
9fa48492
yuslepukhin yuslepukhin requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 94 days ago
yuslepukhin Merge branch 'main' into yuslepukhin/layering
50c58c90
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-03-09
yuslepukhin Fix build errors and address Copilt comments
e445b608
yuslepukhin Reject duplicate rules
358f7df6
yuslepukhin Move methods to .cc
653fb8b1
yuslepukhin Remove code duplication
23a8ecfa
yuslepukhin Add missing include
ef1227e5
yuslepukhin Fix matching bug
b0b23966
yuslepukhin Change index parsing
b9e13cfa
yuslepukhin Remove wrong comment
add0227e
yuslepukhin Address minimal build issues
17e35254
yuslepukhin Fix unused arg
1b1a7dbb
adrianlizarraga
adrianlizarraga commented on 2026-03-13
yuslepukhin yuslepukhin marked this pull request as ready for review 87 days ago
yuslepukhin yuslepukhin requested a review from jambayk jambayk 87 days ago
yuslepukhin Add logging
88c2c479
yuslepukhin Make sure the annotation is copied on node copy
9b0b5298
yuslepukhin Adjust error message
dab76bc0
yuslepukhin Copy Annotations when copying nodes and inlining functions
b39a487d
yuslepukhin Update LayeringIndex after function inlining
4e260bce
yuslepukhin Add intermediate buffers accounting + temp coefficient
52143507
yuslepukhin Merge branch 'main' into yuslepukhin/layering
3fb1d1e3
tianleiwu
tianleiwu commented on 2026-03-24
tianleiwu
yuslepukhin Address MakeNodeUnassigned feedback
cd73b56d
yuslepukhin Address InlineNodes feedback
dfe4d139
yuslepukhin Fix underaccounting for shared weights in fused nodes
62f3d114
yuslepukhin yuslepukhin changed the title [POC] Annotation based partitioning along with resource accounting Annotation based partitioning along with resource accounting 78 days ago
yuslepukhin yuslepukhin requested a review from tianleiwu tianleiwu 78 days ago
yuslepukhin yuslepukhin requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 78 days ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-03-25
tianleiwu
tianleiwu commented on 2026-03-25
yuslepukhin Update onnxruntime/python/tools/layering/layer_annotate.py
3cab988f
github-advanced-security
github-advanced-security commented on 2026-03-26
github-advanced-security
github-advanced-security commented on 2026-03-26
github-actions
github-actions commented on 2026-03-26
yuslepukhin Lint
e6cb75f1
yuslepukhin Flip = prefix to exact match
b2ef9a22
yuslepukhin Adjust comments for duplicate annotations
fde63000
yuslepukhin Remove bad comment
7871afa0
yuslepukhin Adjust EpWithNoLayeringRulesSeesAllUnassignedNodes
16ec921b
yuslepukhin Throw on multiple annotations
4da5c3b4
yuslepukhin Make sure annotations are propagated on function inlining
01e4506d
yuslepukhin yuslepukhin requested a review from tianleiwu tianleiwu 77 days ago
yuslepukhin yuslepukhin requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 77 days ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-03-26
yuslepukhin Update include/onnxruntime/core/session/onnxruntime_session_options_c…
3e52f145
yuslepukhin Update onnxruntime/core/framework/graph_partitioner.cc
24a46e8c
yuslepukhin Update onnxruntime/core/graph/graph_utils.h
e745e02a
yuslepukhin Fix issues in python
59b5ccd1
yuslepukhin Address undercounting problem
054f8941
yuslepukhin Add copyright header
09967c3f
yuslepukhin yuslepukhin requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 77 days ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-03-27
yuslepukhin Update onnxruntime/core/framework/graph_partitioner.cc
d23ee080
yuslepukhin Adjust doc and implementaton for fetching layering ann
9e8be7a4
yuslepukhin Make GetContainingGraph public
f636138d
yuslepukhin Adjust accounting for fused node and remove stray local var
fcda5245
yuslepukhin Address flaky test
2da1394a
yuslepukhin yuslepukhin requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 76 days ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-03-27
yuslepukhin Update onnxruntime/core/providers/cuda/cuda_execution_provider.cc
c0c5e519
yuslepukhin Update onnxruntime/core/graph/graph_utils.cc
c55bb6ec
yuslepukhin Address review issues
cdd9faa8
yuslepukhin Fix potential perf issue
67a947b4
yuslepukhin yuslepukhin requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 76 days ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-03-27
yuslepukhin Address review comments
44c69046
yuslepukhin yuslepukhin requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 76 days ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-03-27
tianleiwu
tianleiwu commented on 2026-03-27
tianleiwu
tianleiwu commented on 2026-03-27
yuslepukhin Add documentation for ann and ep propagation. Fix L1 optimizers, add …
927a0efa
tianleiwu
tianleiwu approved these changes on 2026-03-30
tianleiwu tianleiwu merged f4bdbb8d into main 73 days ago
tianleiwu tianleiwu deleted the yuslepukhin/layering branch 73 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone