phi2 conversion/optimization script #19338
init
b0331c44
update
bf9334ac
update
31304602
update
4aed9b0e
update
97ee6eed
update
990c1da5
update
15324e5f
update
013f5510
update
eed75b81
update
f95d021a
update
5e48dc6a
update
0d796fad
Update README.md
ad42a823
add examples
4a5232be
Merge branch 'wangye/phi2_doc' of github.com:microsoft/onnxruntime in…
80f83a81
Update README.md
5eb79031
Update README.md
1489dc72
Update README.md
76ba8507
Update README.md
36d26441
update
32aa2593
update
5fc6ddd0
update
298f458a
Merge branch 'main' of github.com:microsoft/onnxruntime into wangye/p…
f7a80dd0
update
c588b44f
Update README.md
63238523
gh-yewang
marked this pull request as ready for review 2 years ago
Update README.md
1249acf5
fix link
7ec7ac12
review comments
ba1b5066
lint
64dae351
Update README.md
eb7c3505
Update onnxruntime/python/tools/transformers/models/phi2/convert_to_o…
1e471039
onnxruntime/python/tools/transformers/models/phi2/inference_example.py
f74c9511
lint
ccb73ab8
black
55c55160
lint + add gqa to symbolic shape infer
0f437f9f
add output dir
70fc4cea
Merge branch 'main' of github.com:microsoft/onnxruntime into wangye/p…
bd01cf6d
cover gqa's new change/fix typo
cf76ca86
add op statistics
83d7d4b3
refactor
cf8aa62e
mention memory limit in doc
35b0dd22
Merge branch 'main' of github.com:microsoft/onnxruntime into wangye/p…
b6679c79
fix link
f8b82add
install torch cu118
d2e0a286
shape infer change
89fa252d
Update README.md
3ccb31bf
Update requirements.txt
26519777
fix docs type
c51d59d6
Merge branch 'main' of github.com:microsoft/onnxruntime into wangye/p…
7eb32ce5
tianleiwu
approved these changes
on 2024-02-03
gh-yewang
merged
aaf32fb1
into main 2 years ago
gh-yewang
deleted the wangye/phi2_doc branch 2 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub