intel/auto-round

Pull Requests Commits

Add unit test (#173)

XuehaoSun committed 1 year ago

Verified 81624095

add initial support for activation quantization (#176)

wenhuach21 committed 1 year ago

Verified 5f67048c

speedup the tuning a little (#175)

wenhuach21 committed 1 year ago

Verified 473f474d

add chat template in calib tokenization (#171)

yintong-lu committed 1 year ago

Verified 735dfc9e

[Large impact]set the default nsamples to 128 and low_gpu_mem_usage to False (#174)

wenhuach21 committed 1 year ago

Verified ab614824

support marlin in auto_round format (#172)

wenhuach21 committed 1 year ago

Verified 2b1448d4

revert the gptq format code to fix the regression (#168)

wenhuach21 committed 1 year ago

Verified 5947e9c0

fix typos, update overview img (#166)

WeiweiZhang1 committed 1 year ago

Verified 8d5765ac

1 fix a bug in autoround format with the latest transformers 2 rename n_samples n_blocks to nsamples nblocks (#163)

wenhuach21 committed 1 year ago

Verified f9e7d79e

WeiweiZhang1 committed 1 year ago

Verified 31c566cc

fix bug and limit numpy version (#159)

yintong-lu committed 1 year ago

Verified 77320b0a

support calibration dataset concat (#147)

yintong-lu committed 1 year ago

Verified 75e3fde0

remove gpt ppl eval from lm-0.4.2 (#158)

wenhuach21 committed 1 year ago

Verified 77d6a886

fix bug at whole block is excluded from quantization (#156)

wenhuach21 committed 1 year ago

Verified edcec56e

auto round quantizer supports gptq kernel (#155)

wenhuach21 committed 1 year ago

Verified 9cae103d

fix qbits issue (#153)

wenhuach21 committed 1 year ago

Verified c313fa33

Qbits related log (#151)

zhewang1-intc committed 1 year ago

Verified 34274fb3

autoround_support_qbits_backend (#145)

zhewang1-intc committed 1 year ago

Verified dbdc4a39

fix incorrect setting for lm-head (#149)

wenhuach21 committed 1 year ago

Verified 9da2beed

fix triton issue (#148)

wenhuach21 committed 1 year ago

Verified 04ea8694

refine the code (#143)

wenhuach21 committed 1 year ago

Verified 59c64022

Fix exlllamav2 backend issue (#144)

wenhuach21 committed 1 year ago

Verified e614d138

Fix asym kernel issue by following autogptq's pr (#137)

wenhuach21 committed 1 year ago

Verified 794cd903

fix typos (#140)

WeiweiZhang1 committed 2 years ago

Verified 4d2d2591

bump version into v0.2 (#139)

chensuyue committed 2 years ago

Verified aafb82ef

handling transformers version compatibility in lmhead export, bugfix (#130)

WeiweiZhang1 committed 2 years ago

Verified 4db22e1d

fix export issue with torch 2.0 (#129)

wenhuach21 committed 2 years ago

Verified 5bff86ee

Update falcon recipe (#128)

wenhuach21 committed 2 years ago

Verified 416ec7e9

fix falcon quant issue with disable_trust_remote_code (#126)

WeiweiZhang1 committed 2 years ago

Verified e2985fdf

Update phi2 recipe (#124)

wenhuach21 committed 2 years ago

Verified edca2980

Newer Older