auto-round
[Step1 ]new architecture for auto_round
#1542
Merged

[Step1 ]new architecture for auto_round #1542

chensuyue merged 123 commits into main from hengguo/new_ar_arch
n1ck-guo
n1ck-guo init
7698b934
n1ck-guo n1ck-guo requested a review from lkk12014402 lkk12014402 98 days ago
n1ck-guo n1ck-guo requested a review from xin3he xin3he 98 days ago
n1ck-guo n1ck-guo requested a review from lvliang-intel lvliang-intel 98 days ago
n1ck-guo n1ck-guo requested a review from wenhuach21 wenhuach21 98 days ago
n1ck-guo n1ck-guo requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 98 days ago
n1ck-guo n1ck-guo added draft
n1ck-guo n1ck-guo added engineering
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-03-13
wenhuach21
wenhuach21
wenhuach21 commented on 2026-03-13
n1ck-guo n1ck-guo removed review request from xin3he xin3he 97 days ago
n1ck-guo n1ck-guo requested a review from WeiweiZhang1 WeiweiZhang1 97 days ago
n1ck-guo n1ck-guo requested a review from yiliu30 yiliu30 97 days ago
n1ck-guo Merge branch 'main' of https://github.com/intel/auto-round into hengg…
75b4141e
n1ck-guo update
ca170974
n1ck-guo Merge branch 'main' of https://github.com/intel/auto-round into hengg…
a092e379
lvliang-intel
lvliang-intel commented on 2026-03-16
lvliang-intel
lvliang-intel commented on 2026-03-16
lvliang-intel
lvliang-intel commented on 2026-03-16
lvliang-intel
lvliang-intel commented on 2026-03-16
lvliang-intel
lvliang-intel commented on 2026-03-16
n1ck-guo Merge branch 'main' of https://github.com/intel/auto-round into hengg…
cec4ce4b
chensuyue chensuyue added this to the 0.12.0 milestone 94 days ago
n1ck-guo update
e265b8fe
n1ck-guo merge main
868a82db
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
9dc930c6
n1ck-guo add switch
70a2d026
n1ck-guo n1ck-guo changed the title [WIP] new architecture for auto_round new architecture for auto_round 93 days ago
wenhuach21
wenhuach21 commented on 2026-03-17
n1ck-guo n1ck-guo removed draft
n1ck-guo n1ck-guo added api/new
n1ck-guo code scan
5998d444
n1ck-guo n1ck-guo requested a review from thuang6 thuang6 93 days ago
n1ck-guo n1ck-guo requested a review from xin3he xin3he 93 days ago
n1ck-guo n1ck-guo requested a review from lvliang-intel lvliang-intel 93 days ago
n1ck-guo Merge branch 'hengguo/new_ar_arch' of https://github.com/intel/auto-r…
94125969
n1ck-guo n1ck-guo requested a review from wenhuach21 wenhuach21 93 days ago
wenhuach21
yiliu30
yiliu30 commented on 2026-03-17
n1ck-guo fix
394dcdd5
n1ck-guo Merge branch 'main' of https://github.com/intel/auto-round into hengg…
7024cad0
yiliu30
yiliu30 commented on 2026-03-18
n1ck-guo fix
36daba08
n1ck-guo fix
6feed993
n1ck-guo fix qweight
7bd3e62b
n1ck-guo fix ut and refactor code
9b149183
n1ck-guo fix ut
2ab9b51b
n1ck-guo fix
dd5aec7e
n1ck-guo Merge branch 'main' of https://github.com/intel/auto-round into hengg…
d65f1ebf
n1ck-guo fix merge
bde95c68
n1ck-guo fix
7b4e479a
n1ck-guo update
9b4cab71
n1ck-guo merge main
b602e008
n1ck-guo sync merge change
a1fe717c
n1ck-guo fix
b58d55aa
wenhuach21
wenhuach21 commented on 2026-03-24
wenhuach21
wenhuach21 commented on 2026-03-24
wenhuach21
wenhuach21
wenhuach21 commented on 2026-03-25
wenhuach21
wenhuach21 commented on 2026-03-25
wenhuach21
wenhuach21 commented on 2026-03-25
wenhuach21
wenhuach21 commented on 2026-03-25
wenhuach21
wenhuach21 commented on 2026-03-25
wenhuach21
wenhuach21 commented on 2026-03-25
wenhuach21
wenhuach21 commented on 2026-03-25
n1ck-guo fix ut
6a7ac607
n1ck-guo Merge branch 'main' of https://github.com/intel/auto-round into hengg…
64d4a570
n1ck-guo decoupling quantization and refactor hadamard
b753bab1
n1ck-guo support multi rotation
b32bc685
n1ck-guo Merge branch 'main' of https://github.com/intel/auto-round into hengg…
dbd1ab0e
n1ck-guo sync compressors_new: add is_dynamic_afp8, is_block_wfp8, _get_safete…
f4da8be2
n1ck-guo merge main
75a472a2
n1ck-guo n1ck-guo requested a review from yiliu30 yiliu30 79 days ago
n1ck-guo fix
01f68718
n1ck-guo Merge branch 'main' of https://github.com/intel/auto-round into hengg…
53bef7c5
n1ck-guo Merge branch 'main' of https://github.com/intel/auto-round into hengg…
20ade76c
n1ck-guo fix
41e75bd2
n1ck-guo fix
92139d6c
wenhuach21
wenhuach21 commented on 2026-03-31
wenhuach21
wenhuach21 commented on 2026-03-31
wenhuach21
wenhuach21 commented on 2026-03-31
wenhuach21
wenhuach21 commented on 2026-03-31
wenhuach21
wenhuach21 commented on 2026-03-31
wenhuach21
wenhuach21 commented on 2026-03-31
n1ck-guo fix output dir
166b5b65
lkk12014402
lkk12014402 commented on 2026-03-31
wenhuach21
wenhuach21 commented on 2026-03-31
lkk12014402
wenhuach21
wenhuach21
n1ck-guo
n1ck-guo
wenhuach21
n1ck-guo update by comment
31b2d2b9
n1ck-guo Merge branch 'main' of https://github.com/intel/auto-round into hengg…
4490a17a
wenhuach21
wenhuach21 commented on 2026-04-01
wenhuach21
wenhuach21 commented on 2026-04-01
wenhuach21
wenhuach21 requested changes on 2026-04-01
wenhuach21 wenhuach21 requested a review from wenhuach21 wenhuach21 78 days ago
wenhuach21
wenhuach21 commented on 2026-04-01
wenhuach21
wenhuach21 commented on 2026-04-01
n1ck-guo update
fdc92c2f
n1ck-guo fix
fb046131
n1ck-guo fix by comment
45882798
n1ck-guo fix output_dir
a313c264
n1ck-guo fix
19f95eda
n1ck-guo fix
29d2b64e
n1ck-guo merge
bfec8423
lvliang-intel
lvliang-intel commented on 2026-04-03
n1ck-guo fix
1c9e5296
n1ck-guo fix vlm ut
7e7fdeb5
lvliang-intel
lvliang-intel commented on 2026-04-03
lvliang-intel
lvliang-intel commented on 2026-04-03
lvliang-intel
lvliang-intel commented on 2026-04-03
lvliang-intel
lvliang-intel commented on 2026-04-03
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
4a035fbb
n1ck-guo fix ut
463bb6c2
n1ck-guo Merge branch 'main' of https://github.com/intel/auto-round into hengg…
f5d6ff4f
n1ck-guo sync merge
755ab4e1
n1ck-guo fix by comment
d661e0b2
n1ck-guo merge
7a80debd
n1ck-guo fix
08770cf5
n1ck-guo Merge branch 'main' of https://github.com/intel/auto-round into hengg…
709269af
n1ck-guo fix
97b89dda
n1ck-guo performance
00252569
n1ck-guo n1ck-guo added ready
n1ck-guo n1ck-guo requested a review from lkk12014402 lkk12014402 71 days ago
n1ck-guo n1ck-guo requested a review from lvliang-intel lvliang-intel 71 days ago
n1ck-guo fix
18311269
n1ck-guo fix
8873eca8
n1ck-guo fix
a1a42447
n1ck-guo preformance
bd755361
n1ck-guo Merge branch 'main' of https://github.com/intel/auto-round into hengg…
f2940bd2
n1ck-guo n1ck-guo requested a review from wenhuach21 wenhuach21 69 days ago
n1ck-guo sync
e4ce4206
yiliu30
yiliu30 commented on 2026-04-10
n1ck-guo fix
1286749c
n1ck-guo Merge branch 'main' of https://github.com/intel/auto-round into hengg…
99143064
n1ck-guo
azure-pipelines
n1ck-guo performance
5c212b56
wenhuach21
wenhuach21 commented on 2026-04-10
wenhuach21
wenhuach21 commented on 2026-04-10
wenhuach21
wenhuach21 commented on 2026-04-10
n1ck-guo
n1ck-guo
azure-pipelines
n1ck-guo Merge branch 'main' of https://github.com/intel/auto-round into hengg…
550158be
n1ck-guo performance
4806d5aa
n1ck-guo fix
1f1fbd93
n1ck-guo update
e4fdfe6d
n1ck-guo fix: skip compile_func for FP8_STATIC on HPU + trim malloc at ModelCo…
ec45a1c5
n1ck-guo fix(memory): reduce peak RSS for new arch via forced malloc_trim and …
3cd3c739
xin3he
xin3he commented on 2026-04-13
n1ck-guo merge main
5d4a85db
n1ck-guo fix(memory): reduce peak RAM via deferred ShardWriter, intermediate G…
14a59db1
n1ck-guo
azure-pipelines
wenhuach21
lkk12014402
lkk12014402 approved these changes on 2026-04-14
n1ck-guo fix
29969c88
azure-pipelines
n1ck-guo merge main
654c733b
n1ck-guo
azure-pipelines
n1ck-guo update
c7f21a75
azure-pipelines
n1ck-guo Merge branch 'main' of https://github.com/intel/auto-round into hengg…
72c04f9d
n1ck-guo sync hadamard transform changes from main branch to new architecture
028bb069
azure-pipelines
n1ck-guo
azure-pipelines
n1ck-guo fix sglang test: switch OPT to Qwen3-0.6B to avoid fused qkv_proj reg…
451008b5
n1ck-guo fix: invalidate compiled block forward cache on block change; guard l…
8bdf054e
n1ck-guo Merge origin/main: resolve conflicts in test files
8f580393
n1ck-guo sync 8d7bb84c to new arch: enable immediate_saving for nv_fp/mx_fp, f…
d067b534
n1ck-guo merge main
8d2f3419
n1ck-guo fix ut
cec09580
n1ck-guo
azure-pipelines
wenhuach21 wenhuach21 requested a review from wenhuach21 wenhuach21 62 days ago
wenhuach21
wenhuach21 approved these changes on 2026-04-17
wenhuach21 wenhuach21 changed the title new architecture for auto_round [Step1 ]new architecture for auto_round 62 days ago
n1ck-guo fix HPU FP8_STATIC peak RAM: disable eager pipeline in new-arch, clea…
459435ca
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
463f58d2
n1ck-guo test: add unit tests for HPU FP8_STATIC eager pipeline guard
6cbb1560
n1ck-guo fix ut
9f88982b
n1ck-guo Merge branch 'hengguo/new_ar_arch' of https://github.com/intel/auto-r…
50beadcb
n1ck-guo Merge branch 'main' into hengguo/new_ar_arch
e21c349b
yiliu30
yiliu30 approved these changes on 2026-04-20
n1ck-guo merge main
18ba254d
n1ck-guo fix merge
e6de66bb
n1ck-guo fix
85740b5c
n1ck-guo clean
66c4da12
n1ck-guo fix W8A16 VRAM regression: skip block_forward compile for zero-shot path
ab9d972d
n1ck-guo Merge main: resolve conflicts, sync calib.py shared_cache_keys fix
53bdb74c
n1ck-guo sync rotation/hadamard: handle rotation_config kwarg in AutoRoundComp…
70ed236e
n1ck-guo sync: rename HadamardConfig to RotationConfig in new arch transforms
c07ca3f1
n1ck-guo refactor: rename algorithms/transforms/hadamard -> rotation
4f534100
n1ck-guo refactor: dedupe experimental/transform, re-export from new arch rota…
00fc8006
n1ck-guo refactor(rotation): physically migrate inplace+dispatcher to new arch
8c96d9b3
n1ck-guo refactor(rotation): convert experimental/transform/apply.py to shim
01f02cf2
n1ck-guo refactor(entry): simplify rotation_config kwarg handling in AutoRound…
08caa0d4
n1ck-guo feat(rotation): support backend='inplace' in new-arch alg_configs pip…
70631e99
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
d5c8e9ba
n1ck-guo merge and sync main
aa4c5407
n1ck-guo clean
4c78670d
n1ck-guo clean
e4e025eb
n1ck-guo
azure-pipelines
n1ck-guo fix(transforms): handle block_size=None in HadamardTransform
e3087956
n1ck-guo fix cuda ut
110e9ead
n1ck-guo Merge remote-tracking branch 'origin/main' into hengguo/new_ar_arch
baf6bd33
n1ck-guo merge and sync main
c97428c0
n1ck-guo Merge branch 'main' of https://github.com/intel/auto-round into hengg…
bad859a5
n1ck-guo sync: add xpu sdpa patch and AutoScheme VLM support to new arch
441d060f
n1ck-guo
azure-pipelines
n1ck-guo Merge branch 'main' of https://github.com/intel/auto-round into hengg…
dd7b7f0f
n1ck-guo merge main
efe2c74b
n1ck-guo
azure-pipelines
n1ck-guo fix diffusion ut
26aa1066
n1ck-guo diffusion: align new arch with old DiffusionCompressor
9d98b0da
n1ck-guo
azure-pipelines
n1ck-guo fix
2fe5f032
n1ck-guo
azure-pipelines
n1ck-guo
azure-pipelines
chensuyue chensuyue merged 91585c7d into main 51 days ago
chensuyue chensuyue deleted the hengguo/new_ar_arch branch 51 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone