auto-round
refactor and support for multi algs fusion
#1852
Closed

refactor and support for multi algs fusion #1852

n1ck-guo wants to merge 72 commits into main from hengguo/refactor_algs
n1ck-guo
n1ck-guo refactor and support for multi algs fusion
d8fc7cc4
n1ck-guo n1ck-guo requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 22 days ago
n1ck-guo Merge branch 'main' into hengguo/refactor_algs
18e8b15d
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
4de34b8d
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-05-26
wenhuach21
n1ck-guo fix bugs
c5c08df3
n1ck-guo fix and hanle shared config
be623ea1
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
006374eb
xin3he dynamic map checkpoint naming based on model objective. (#1840)
10d8826a
wenhuach21 refine/fix gptq format (#1853)
8bce7b04
WeiweiZhang1 fix save_quantized log conflict (#1845)
f2a9b1db
n1ck-guo merge main
e70aeeec
n1ck-guo Merge branch 'hengguo/refactor_algs' of https://github.com/intel/auto…
76bef3af
wenhuach21 update doc (#1856)
b5d5c34b
lvliang-intel Fix Qwen Omni quantization model issue for long form audio generation…
f7a13934
n1ck-guo fix bug of qwen and gguf export (#1846)
502b800e
lkk12014402 refactor quarot/spinquant rotation with simplying code. (#1849)
717d252e
chensuyue Enhance CI workflow (#1858)
243c68a9
lvliang-intel Fix gemma4 crash issue during quantizing (#1860)
13b15498
n1ck-guo relocate awq
d72fd9bc
n1ck-guo Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs
4e7949cf
yiliu30 fix: preserve FP8 llmc metadata for mxfp8 (#1861)
1b8767f1
luoyu-intel Fix SDPA bug (#1862)
378831f0
lvliang-intel Fix special-model predefined ignore layer filtering (#1863)
75147630
xin3he Fix packing format in quantization config and update variable assignm…
3997a8c4
lvliang-intel
lvliang-intel commented on 2026-05-28
lvliang-intel
lvliang-intel commented on 2026-05-28
lvliang-intel
lvliang-intel commented on 2026-05-28
lvliang-intel
lvliang-intel commented on 2026-05-28
n1ck-guo refactor scheme and entry
70611ec9
n1ck-guo modify by comments
a2ea6119
n1ck-guo merge main
25956b20
wenhuach21 Fix unsupported dtype (#1868)
473d9035
n1ck-guo Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs
b3294eda
n1ck-guo fix ut
93caa5cd
n1ck-guo n1ck-guo requested a review from WeiweiZhang1 WeiweiZhang1 19 days ago
n1ck-guo fix
77a26e6f
XuehaoSun Fix HPU CI (#1870)
443f288c
n1ck-guo Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs
2b26e4f2
n1ck-guo add llmc api
f753c062
chensuyue
azure-pipelines
xin3he fix segmentation failure (#1871)
814c6683
chensuyue Bump build version to v0.14.0 (#1875)
dc269095
WeiweiZhang1 Fix SGL CI dependency conflict (#1877)
e6b631b6
n1ck-guo gguf format add support for mtp quantization (#1866)
db6b76dd
n1ck-guo Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs
b81643a1
n1ck-guo fix gguf error
ee10d643
luoyu-intel [ARK] Add bf16 + NHD layout support, refactor sage_dynamic_quant (#1882)
c74773ea
lvliang-intel Suppress misleading warning when detecting model type for GGUF export…
050c501b
n1ck-guo fix ut
d2ebbd06
n1ck-guo Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs
4fc9724a
wenhuach21 fix performance regression (#1886)
9f254fbe
n1ck-guo fix
5a968849
n1ck-guo Merge branch 'main' into hengguo/refactor_algs
2638c1c7
chensuyue
azure-pipelines
n1ck-guo Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs
42d77710
n1ck-guo Merge branch 'hengguo/refactor_algs' of https://github.com/intel/auto…
fde14a58
chensuyue
azure-pipelines
n1ck-guo clean and update
9edf0d03
chensuyue
azure-pipelines
n1ck-guo refactor pipeline
7fb0839e
wenhuach21
wenhuach21 commented on 2026-06-08
n1ck-guo change num_samples to property
30f1dd05
lkk12014402 fix random rotation and update rotation doc. (#1884)
6afd14c2
chensuyue Update auto-round-lib release package build (#1895)
0de1eb05
wenhuach21 [step 1]refine code to support all devices in torch and hot fix for g…
2794a6e2
chensuyue Fix CI coverage & bug grep issue (#1893)
5ed21d3e
xin3he feat: add MXFP4/MXFP8 quantization support (llmc_compressor format) a…
6cdb2a20
n1ck-guo Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs
2a12b197
n1ck-guo fix
eac3ab8d
chensuyue
azure-pipelines
XuehaoSun Fix slow startup time of pytest coverage for unit tests (#1899)
2890673c
yogesh-tessl feat: improve review-pr skill score from 76% to 90% (#1901)
205e5f60
wenhuach21
XuehaoSun update llama-cpp-python installation for CUDA CI (#1907)
fb9a772f
wenhuach21 fix gguf opt-rtn regression (#1905)
bfa795ca
Entrpi fix: guard zero-division in GGUF quant kernels to avoid NaN block sca…
d6153cb7
yiliu30 fallback compute type on b70 if needed (#1904)
83cbe978
luoyu-intel [ARK] update README (#1906)
0068522d
wenhuach21 fix inplace rotation issue (#1903)
d13b1ddd
WeiweiZhang1 fix awq cuda CI (#1912)
be67ef3a
n1ck-guo Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs
8c731933
n1ck-guo add type annotation
241cb393
n1ck-guo n1ck-guo added enhancement
n1ck-guo n1ck-guo added api/change
n1ck-guo n1ck-guo added ready
n1ck-guo n1ck-guo requested a review from lvliang-intel lvliang-intel 5 days ago
n1ck-guo n1ck-guo requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 5 days ago
n1ck-guo n1ck-guo requested a review from wenhuach21 wenhuach21 5 days ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2026-06-12
n1ck-guo performance
12814d5f
n1ck-guo performance
3fecf0bc
n1ck-guo Merge branch 'hengguo/refactor_algs' of https://github.com/intel/auto…
6f219c60
n1ck-guo n1ck-guo force pushed from 181aab8f to 6f219c60 4 days ago
n1ck-guo n1ck-guo closed this 4 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone