[High Risk]reduce vram usage for optimized RTN mode #1043
reduce vram
8ceb6a1d
[pre-commit.ci] auto fixes from pre-commit.com hooks
97f460e4
xin3he
commented
on 2025-11-18
xin3he
commented
on 2025-11-18
update
dd27f91a
[pre-commit.ci] auto fixes from pre-commit.com hooks
0ea4fa23
update
d40de66e
[pre-commit.ci] auto fixes from pre-commit.com hooks
67a1b343
fix bug
2468f0a4
[pre-commit.ci] auto fixes from pre-commit.com hooks
0bd2cf92
update
77014877
Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…
b54c1e48
[pre-commit.ci] auto fixes from pre-commit.com hooks
42e5cc0d
git push
1307dddb
wenhuach21
marked this pull request as draft 44 days ago
fix accuracy bug
6d6d86ab
wenhuach21
marked this pull request as ready for review 44 days ago
trigger ut
e2586f95
clean code
c7b3c241
wenhuach21
changed the title reduce vram [WIP]reduce vram usage for RTN mode 44 days ago
q80 q4k
8ad2019e
[pre-commit.ci] auto fixes from pre-commit.com hooks
d3168544
q5k
1743472b
[pre-commit.ci] auto fixes from pre-commit.com hooks
5ffa12b6
all ggufs use inplace ops
db5c6423
update
ec6cb462
Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…
ab5067b6
update
5a503b4d
[pre-commit.ci] auto fixes from pre-commit.com hooks
737977a9
Update auto_round/export/export_to_gguf/packing.py
c3b9213b
Update auto_round/export/export_to_gguf/packing.py
c2fe2672
Update auto_round/export/export_to_gguf/packing.py
963e6f90
Update auto_round/export/export_to_gguf/packing.py
4c6366ab
Update auto_round/data_type/gguf.py
343dbb6b
Update auto_round/compressors/base.py
932f407d
Update auto_round/export/export_to_gguf/packing.py
8304a025
fix by comments
bc86fdcf
Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…
08514f9d
[pre-commit.ci] auto fixes from pre-commit.com hooks
033330e7
wenhuach21
changed the title [WIP]reduce vram usage for RTN mode reduce vram usage for optimized RTN mode 43 days ago
fix line too long
c905bd27
wenhuach21
changed the title reduce vram usage for optimized RTN mode [High Risk]reduce vram usage for optimized RTN mode 43 days ago
update readme
a6165793
update
cd01f132
[pre-commit.ci] auto fixes from pre-commit.com hooks
876267e2
clean code
1138c737
Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…
111933fa
n1ck-guo
approved these changes
on 2025-11-20
update
be5e13cf
Merge branch 'main' into optimize_gguf_vram
d97429fd
[pre-commit.ci] auto fixes from pre-commit.com hooks
190ea033
fix typo
f16cde5c
update
9f408d15
[pre-commit.ci] auto fixes from pre-commit.com hooks
2d059f3f
Merge branch 'main' into optimize_gguf_vram
7f27d723
try to fix ut failure
78499a76
try to fix ut failure
575103fc
[pre-commit.ci] auto fixes from pre-commit.com hooks
55efbbca
try to fix ut failure
9fa4cd99
Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…
56abaa88
[pre-commit.ci] auto fixes from pre-commit.com hooks
70a3fdb8
try to fix ut failure
b035b4f7
update
a7cd959f
[pre-commit.ci] auto fixes from pre-commit.com hooks
ae99930c
fix
79b24902
update
29b51885
Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…
f840d2b5
[pre-commit.ci] auto fixes from pre-commit.com hooks
5e0bb6c5
fix typo
d6d29795
Merge branch 'optimize_gguf_vram' of https://github.com/intel/auto-ro…
9dcea242
fix bug of gguf mllm
1c8fe02d
[pre-commit.ci] auto fixes from pre-commit.com hooks
0c254d70
refine a little
f85fe7e1
wenhuach21
enabled auto-merge (squash) 38 days ago
refine a little
94085dc7
Merge branch 'main' into optimize_gguf_vram
1c57b19c
wenhuach21
deleted the optimize_gguf_vram branch 38 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub