auto-round
[High Risk]reduce vram usage for optimized RTN mode
#1043
Merged

[High Risk]reduce vram usage for optimized RTN mode #1043

wenhuach21 merged 67 commits into main from optimize_gguf_vram
wenhuach21
wenhuach21 reduce vram
8ceb6a1d
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
97f460e4
xin3he
xin3he commented on 2025-11-18
xin3he
xin3he commented on 2025-11-18
wenhuach21 update
dd27f91a
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
0ea4fa23
wenhuach21 update
d40de66e
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
67a1b343
wenhuach21 fix bug
2468f0a4
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
0bd2cf92
wenhuach21 update
77014877
wenhuach21 Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…
b54c1e48
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
42e5cc0d
wenhuach21 git push
1307dddb
wenhuach21 wenhuach21 marked this pull request as draft 44 days ago
wenhuach21 fix accuracy bug
6d6d86ab
wenhuach21 wenhuach21 marked this pull request as ready for review 44 days ago
wenhuach21 trigger ut
e2586f95
wenhuach21 clean code
c7b3c241
wenhuach21 wenhuach21 changed the title reduce vram [WIP]reduce vram usage for RTN mode 44 days ago
wenhuach21 q80 q4k
8ad2019e
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
d3168544
wenhuach21 q5k
1743472b
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
5ffa12b6
wenhuach21 all ggufs use inplace ops
db5c6423
wenhuach21 update
ec6cb462
wenhuach21 Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…
ab5067b6
wenhuach21 update
5a503b4d
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
737977a9
wenhuach21 wenhuach21 requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 43 days ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2025-11-19
wenhuach21 Update auto_round/export/export_to_gguf/packing.py
c3b9213b
wenhuach21 Update auto_round/export/export_to_gguf/packing.py
c2fe2672
wenhuach21 Update auto_round/export/export_to_gguf/packing.py
963e6f90
wenhuach21 Update auto_round/export/export_to_gguf/packing.py
4c6366ab
wenhuach21 Update auto_round/data_type/gguf.py
343dbb6b
wenhuach21 Update auto_round/compressors/base.py
932f407d
wenhuach21 Update auto_round/export/export_to_gguf/packing.py
8304a025
wenhuach21 fix by comments
bc86fdcf
wenhuach21 Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…
08514f9d
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
033330e7
wenhuach21 wenhuach21 requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 43 days ago
wenhuach21 wenhuach21 changed the title [WIP]reduce vram usage for RTN mode reduce vram usage for optimized RTN mode 43 days ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2025-11-19
wenhuach21 fix line too long
c905bd27
wenhuach21 wenhuach21 changed the title reduce vram usage for optimized RTN mode [High Risk]reduce vram usage for optimized RTN mode 43 days ago
wenhuach21 wenhuach21 requested a review from n1ck-guo n1ck-guo 43 days ago
wenhuach21 wenhuach21 requested a review from WeiweiZhang1 WeiweiZhang1 43 days ago
wenhuach21 wenhuach21 requested a review from yiliu30 yiliu30 43 days ago
wenhuach21 update readme
a6165793
wenhuach21 update
cd01f132
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
876267e2
wenhuach21
wenhuach21 commented on 2025-11-19
wenhuach21 clean code
1138c737
wenhuach21 Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…
111933fa
n1ck-guo
n1ck-guo commented on 2025-11-20
n1ck-guo
n1ck-guo commented on 2025-11-20
n1ck-guo
n1ck-guo commented on 2025-11-20
n1ck-guo
n1ck-guo approved these changes on 2025-11-20
wenhuach21 update
be5e13cf
wenhuach21 Merge branch 'main' into optimize_gguf_vram
d97429fd
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
190ea033
wenhuach21 fix typo
f16cde5c
wenhuach21 update
9f408d15
wenhuach21
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
2d059f3f
wenhuach21 Merge branch 'main' into optimize_gguf_vram
7f27d723
wenhuach21 try to fix ut failure
78499a76
wenhuach21 try to fix ut failure
575103fc
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
55efbbca
wenhuach21 try to fix ut failure
9fa4cd99
wenhuach21 Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…
56abaa88
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
70a3fdb8
wenhuach21 try to fix ut failure
b035b4f7
wenhuach21 update
a7cd959f
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
ae99930c
wenhuach21 fix
79b24902
wenhuach21 update
29b51885
wenhuach21 Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…
f840d2b5
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
5e0bb6c5
wenhuach21 fix typo
d6d29795
wenhuach21 Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…
9dcea242
n1ck-guo fix bug of gguf mllm
1c8fe02d
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
0c254d70
wenhuach21 refine a little
f85fe7e1
wenhuach21 wenhuach21 enabled auto-merge (squash) 38 days ago
wenhuach21
azure-pipelines
wenhuach21
azure-pipelines
wenhuach21 refine a little
94085dc7
wenhuach21 Merge branch 'main' into optimize_gguf_vram
1c57b19c
wenhuach21 wenhuach21 merged e1b89d23 into main 38 days ago
wenhuach21 wenhuach21 deleted the optimize_gguf_vram branch 38 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone