Support for immediate saving to reduce ram usage #965
save per block
9da69a88
enable multi block save
dc24682b
support export save
856ab066
update rtn support
97cbfdbd
support
fb026aa6
rebase main
bb6113f9
del utils.py
148f085d
[pre-commit.ci] auto fixes from pre-commit.com hooks
afb6eefb
fix args
b2c14dfc
xin3he
added this to the 1.0 milestone 229 days ago
xin3he
removed this from to the 1.0 milestone 229 days ago
xin3he
added this to the 0.9.0 milestone 229 days ago
optimize memory and fix rtn module
38297788
revert max_shard_size
8f388395
move save_block_immediate into utils
70d873e2
[pre-commit.ci] auto fixes from pre-commit.com hooks
f4ace2cd
[pre-commit.ci] auto fixes from pre-commit.com hooks
c3142310
fix import
f64cfcf0
update max_shard_size
e471148e
Merge branch 'kaihui/save_block' of https://github.com/intel/auto-rou…
b19db96f
wenhuach21
changed the title Support for immediate saving [High Risk]Support for immediate saving 229 days ago
remove is_meta_model
02f891d6
[pre-commit.ci] auto fixes from pre-commit.com hooks
f6671ed3
merge main
c1da5e56
add uts
39b54705
[pre-commit.ci] auto fixes from pre-commit.com hooks
579a7f80
fix gpu ut model_name
f1793e05
set immediate packing saving to True
5aa3f1ef
[pre-commit.ci] auto fixes from pre-commit.com hooks
67e2591f
Merge branch 'main' into kaihui/save_block
ca55a906
flow saving setting
7a21c5aa
check gguf
e5fd4f24
pack layer_names immediately
5367af64
wrapper _immediate_pack & rm uts & pop lm_head & rm expose args
5065336b
revert ut
ceb2a92d
[pre-commit.ci] auto fixes from pre-commit.com hooks
5375be6d
add low_cpu_mem_usage
70f59ec3
merge main
bd1532b4
[pre-commit.ci] auto fixes from pre-commit.com hooks
a551e131
fix conflict
50a82e2f
update low_cpu_mem_usage
3c682c65
move to last arg
35750399
wenhuach21
changed the title [High Risk]Support for immediate saving Support for immediate saving to reduce ram usage 222 days ago
wenhuach21
deleted the kaihui/save_block branch 222 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub