onnxruntime
Reduce Python and Nuget GPU package size
#26002
Merged

Reduce Python and Nuget GPU package size #26002

chilo-ms merged 10 commits into main from chi/remove_cuda_arch
chilo-ms
chilo-ms remove the oldest CUDA arch
b17b0a2b
chilo-ms update
86f73d2e
snnn snnn requested a review from tianleiwu tianleiwu 104 days ago
chilo-ms revmoe older arch in Linux python whl, Linux nuget and windows nuget
4ae35511
tianleiwu
tianleiwu commented on 2025-09-10
tianleiwu
tianleiwu commented on 2025-09-10
tianleiwu
tianleiwu commented on 2025-09-10
chilo-ms remove old GPU arch for windows nuget
c32103d8
chilo-ms remove old GPU arch for python wheel
69ddc16c
chilo-ms
jywu-msft jywu-msft added release:1.23.0
chilo-ms reduce number of k support in beam search
64f7967d
chilo-ms
chilo-ms revert beam_search_topk
bc3f01c6
chilo-ms remove FPA_INTB_GEMM for Linux python wheel
9b226b16
chilo-ms revert beam_search_topk
b28e74b8
chilo-ms chilo-ms changed the title Remove old CUDA arch in CMAKE_CUDA_ARCHITECTURES to reduce package size Reduce Python and Nuget GPU package size 97 days ago
tianleiwu
tianleiwu dismissed these changes on 2025-09-17
chilo-ms Add back support for SM75 on Linux and disable FPA_INTB_GEMM
5aeb8745
chilo-ms chilo-ms dismissed their stale review via 5aeb8745 96 days ago
snnn
snnn approved these changes on 2025-09-18
tianleiwu
tianleiwu approved these changes on 2025-09-18
chilo-ms chilo-ms merged fd35afb9 into main 95 days ago
chilo-ms chilo-ms deleted the chi/remove_cuda_arch branch 95 days ago
snnn snnn removed release:1.23.0
snnn

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone