auto-round
[High Risk]reduce vram usage for optimized RTN mode
#1043

Merged

[High Risk]reduce vram usage for optimized RTN mode #1043

wenhuach21 merged 67 commits into main from optimize_gguf_vram

reduce vram

8ceb6a1d

[pre-commit.ci] auto fixes from pre-commit.com hooks

97f460e4

xin3he commented on 2025-11-18

update

dd27f91a

[pre-commit.ci] auto fixes from pre-commit.com hooks

0ea4fa23

update

d40de66e

[pre-commit.ci] auto fixes from pre-commit.com hooks

67a1b343

fix bug

2468f0a4

[pre-commit.ci] auto fixes from pre-commit.com hooks

0bd2cf92

update

77014877

Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…

b54c1e48

[pre-commit.ci] auto fixes from pre-commit.com hooks

42e5cc0d

git push

1307dddb

wenhuach21 marked this pull request as draft 200 days ago

fix accuracy bug

6d6d86ab

wenhuach21 marked this pull request as ready for review 200 days ago

trigger ut

e2586f95

clean code

c7b3c241

wenhuach21 changed the title ~~reduce vram~~ [WIP]reduce vram usage for RTN mode 200 days ago

q80 q4k

8ad2019e

[pre-commit.ci] auto fixes from pre-commit.com hooks

d3168544

q5k

1743472b

[pre-commit.ci] auto fixes from pre-commit.com hooks

5ffa12b6

all ggufs use inplace ops

db5c6423

update

ec6cb462

Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…

ab5067b6

update

5a503b4d

[pre-commit.ci] auto fixes from pre-commit.com hooks

737977a9

wenhuach21 requested a review from

copilot-pull-request-reviewer 200 days ago

copilot-pull-request-reviewer commented on 2025-11-19

Update auto_round/export/export_to_gguf/packing.py

c3b9213b

Update auto_round/export/export_to_gguf/packing.py

c2fe2672

Update auto_round/export/export_to_gguf/packing.py

963e6f90

Update auto_round/export/export_to_gguf/packing.py

4c6366ab

Update auto_round/data_type/gguf.py

343dbb6b

Update auto_round/compressors/base.py

932f407d

Update auto_round/export/export_to_gguf/packing.py

8304a025

fix by comments

bc86fdcf

Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…

08514f9d

[pre-commit.ci] auto fixes from pre-commit.com hooks

033330e7

wenhuach21 requested a review from

copilot-pull-request-reviewer 200 days ago

wenhuach21 changed the title ~~[WIP]reduce vram usage for RTN mode~~ reduce vram usage for optimized RTN mode 200 days ago

copilot-pull-request-reviewer commented on 2025-11-19

fix line too long

c905bd27

wenhuach21 changed the title ~~reduce vram usage for optimized RTN mode~~ [High Risk]reduce vram usage for optimized RTN mode 200 days ago

wenhuach21 requested a review from

n1ck-guo 200 days ago

wenhuach21 requested a review from

WeiweiZhang1 200 days ago

wenhuach21 requested a review from

yiliu30 200 days ago

update readme

a6165793

update

cd01f132

[pre-commit.ci] auto fixes from pre-commit.com hooks

876267e2

wenhuach21 commented on 2025-11-19

clean code

1138c737

Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…

111933fa

n1ck-guo commented on 2025-11-20

n1ck-guo approved these changes on 2025-11-20

update

be5e13cf

Merge branch 'main' into optimize_gguf_vram

d97429fd

[pre-commit.ci] auto fixes from pre-commit.com hooks

190ea033

fix typo

f16cde5c

update

9f408d15

[pre-commit.ci] auto fixes from pre-commit.com hooks

2d059f3f

Merge branch 'main' into optimize_gguf_vram

7f27d723

try to fix ut failure

78499a76

try to fix ut failure

575103fc

[pre-commit.ci] auto fixes from pre-commit.com hooks

55efbbca

try to fix ut failure

9fa4cd99

Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…

56abaa88

[pre-commit.ci] auto fixes from pre-commit.com hooks

70a3fdb8

try to fix ut failure

b035b4f7

update

a7cd959f

[pre-commit.ci] auto fixes from pre-commit.com hooks

ae99930c

fix

79b24902

update

29b51885

Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…

f840d2b5

[pre-commit.ci] auto fixes from pre-commit.com hooks

5e0bb6c5

fix typo

d6d29795

Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…

9dcea242

fix bug of gguf mllm

1c8fe02d

[pre-commit.ci] auto fixes from pre-commit.com hooks

0c254d70

refine a little

f85fe7e1

wenhuach21 enabled auto-merge (squash) 195 days ago

refine a little

94085dc7

Merge branch 'main' into optimize_gguf_vram

1c57b19c

wenhuach21 merged e1b89d23 into main 195 days ago

wenhuach21 deleted the optimize_gguf_vram branch 195 days ago

Reviewers

n1ck-guo

xin3he

copilot-pull-request-reviewer

WeiweiZhang1

yiliu30

Assignees

No one assigned

Labels

None yet

Milestone

No milestone

auto-round [High Risk]reduce vram usage for optimized RTN mode #1043 Merged

[High Risk]reduce vram usage for optimized RTN mode #1043

auto-round
[High Risk]reduce vram usage for optimized RTN mode
#1043

Merged