intel/auto-round

Pull Requests Commits

Merge branch 'hengguo/refactor_algs' of https://github.com/intel/auto-round into hengguo/refactor_algs

n1ck-guo committed 9 days ago

6f219c60

n1ck-guo committed 9 days ago

3fecf0bc

n1ck-guo committed 9 days ago

12814d5f

add type annotation

n1ck-guo committed 10 days ago

241cb393

Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs

n1ck-guo committed 10 days ago

8c731933

fix awq cuda CI (#1912)

WeiweiZhang1 committed 10 days ago

be67ef3a

fix inplace rotation issue (#1903)

wenhuach21 committed 11 days ago

d13b1ddd

[ARK] update README (#1906)

luoyu-intel committed 11 days ago

0068522d

fallback compute type on b70 if needed (#1904)

yiliu30 committed 11 days ago

83cbe978

fix: guard zero-division in GGUF quant kernels to avoid NaN block scales (#1909)

Entrpi committed 11 days ago

d6153cb7

fix gguf opt-rtn regression (#1905)

wenhuach21 committed 11 days ago

bfa795ca

update llama-cpp-python installation for CUDA CI (#1907)

XuehaoSun committed 11 days ago

fb9a772f

feat: improve review-pr skill score from 76% to 90% (#1901)

yogesh-tessl committed 12 days ago

205e5f60

Fix slow startup time of pytest coverage for unit tests (#1899)

XuehaoSun committed 12 days ago

2890673c

n1ck-guo committed 12 days ago

eac3ab8d

Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs

n1ck-guo committed 12 days ago

2a12b197

feat: add MXFP4/MXFP8 quantization support (llmc_compressor format) and related tests (#1865)

xin3he committed 12 days ago

6cdb2a20

Fix CI coverage & bug grep issue (#1893)

chensuyue committed 12 days ago

5ed21d3e

[step 1]refine code to support all devices in torch and hot fix for gemma4-unified (#1879)

wenhuach21 committed 13 days ago

2794a6e2

Update auto-round-lib release package build (#1895)

chensuyue committed 13 days ago

0de1eb05

fix random rotation and update rotation doc. (#1884)

lkk12014402 committed 13 days ago

6afd14c2

change num_samples to property

n1ck-guo committed 13 days ago

30f1dd05

refactor pipeline

n1ck-guo committed 13 days ago

7fb0839e

clean and update

n1ck-guo committed 14 days ago

9edf0d03

Merge branch 'hengguo/refactor_algs' of https://github.com/intel/auto-round into hengguo/refactor_algs

n1ck-guo committed 17 days ago

fde14a58

Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs

n1ck-guo committed 17 days ago

42d77710

Merge branch 'main' into hengguo/refactor_algs

n1ck-guo committed 17 days ago

2638c1c7

n1ck-guo committed 18 days ago

5a968849

fix performance regression (#1886)

wenhuach21 committed 18 days ago

9f254fbe

Merge remote-tracking branch 'origin/main' into hengguo/refactor_algs

n1ck-guo committed 18 days ago

4fc9724a

Older