auto-round
reduce ram&vram usage for vlm calib stage
#1488
Merged

reduce ram&vram usage for vlm calib stage #1488

WeiweiZhang1
WeiweiZhang1 reduce ram usage for vlm calib stage
b1c79a76
WeiweiZhang1 WeiweiZhang1 requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 84 days ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-03-03
WeiweiZhang1 Merge branch 'main' into reduce_vram/ram_usage_for_vlm_in_calib_stage
31aa2f59
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
0c225237
wenhuach21
wenhuach21 commented on 2026-03-03
wenhuach21
WeiweiZhang1 WeiweiZhang1 added WIP
n1ck-guo gguf better support for transformers5.0 and fix bug of Qwen3Next (#1474)
3a490376
WeiweiZhang1 Merge branch 'main' into reduce_vram/ram_usage_for_vlm_in_calib_stage
e3f1fdbd
WeiweiZhang1 fix meta issue for patch model
736e7f38
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
54e1a7c4
WeiweiZhang1 fix meta issue of patching model like qwen3.5-35B-A3B
576a417f
WeiweiZhang1 fix scan issue
cf668ce6
WeiweiZhang1 refine early stop calib logic, add ut for mllm calib cache check
d7c67d7c
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
fa4f342a
WeiweiZhang1 Merge branch 'main' into reduce_vram/ram_usage_for_vlm_in_calib_stage
c7f81d01
WeiweiZhang1 WeiweiZhang1 removed WIP
WeiweiZhang1 WeiweiZhang1 requested a review from yiliu30 yiliu30 81 days ago
WeiweiZhang1 WeiweiZhang1 requested a review from n1ck-guo n1ck-guo 81 days ago
yiliu30
yiliu30 approved these changes on 2026-03-06
WeiweiZhang1 Merge branch 'main' into reduce_vram/ram_usage_for_vlm_in_calib_stage
0e8b48e2
WeiweiZhang1 WeiweiZhang1 merged be387131 into main 78 days ago
WeiweiZhang1 WeiweiZhang1 deleted the reduce_vram/ram_usage_for_vlm_in_calib_stage branch 78 days ago
wenhuach21
wenhuach21 commented on 2026-03-09

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone