intel/auto-round

Pull Requests Commits

fix lm-head gradient accumulation bug (#113)

wenhuach21 committed 1 year ago

Verified ecca5349

update shells (#112)

WeiweiZhang1 committed 1 year ago

Verified 3c214db0

Adjust the default evaluation data type by selecting from the model path configuration (#107)

WeiweiZhang1 committed 1 year ago

Verified c42eaa92

20% speedup by removing new zero tensor (#110)

wenhuach21 committed 1 year ago

Verified c7434b63

1.8X speedup by disable_low_gpu_mem_usage and reduce memory usage by avoid using torch.cat (#106)

wenhuach21 committed 1 year ago

Verified 23b60c3a

remove costly operations

wenhuach21 committed 1 year ago

Verified ad3a7bba

Consolidate dataloader&dataset_split to dataset (#105)

wenhuach21 committed 1 year ago

Verified 1f9cb4f8

disable quantizing lm-head with tied weights as a workaround (#102)

wenhuach21 committed 1 year ago

Verified 1fe6aae5

disable quantizing lm-head with tied weights as a workaround (#101)

wenhuach21 committed 1 year ago

Verified c839e825

update readme of calibration dataset and lm-head usage (#98)

wenhuach21 committed 1 year ago

Verified f95b8c7d

fix critic bug for gradient_accumulate_steps!=1 and reduce cpu memory of lm-head tuning (#97)

WeiweiZhang1 committed 1 year ago

Verified 7dd02eb4

handle invalid layername in weight_config (#93)

WeiweiZhang1 committed 1 year ago

Verified 89562261

yintong-lu committed 1 year ago

Verified 9b08de48

deprecate use_quant_inp arg (#90)

yintong-lu committed 1 year ago

Verified 1bf1b486

Add acc data (#89)

pursure-D committed 1 year ago

Verified 8a3da144

fix old eval bug (#86)

WeiweiZhang1 committed 1 year ago

Verified 23d35e32

Update lm-head quantization readme

wenhuach21 committed 1 year ago

Verified 511a5385

add Yi-6b-chat results (#85)

yintong-lu committed 1 year ago

Verified d38a4a56

fix old eval tasks order (#78)

WeiweiZhang1 committed 1 year ago

Verified 257c5be2

Update llama3 acc (#84)

wenhuach21 committed 1 year ago

Verified 849cf9fe

support lm head quantizaiton and export to Intel cpu (#76)

wenhuach21 committed 1 year ago

Verified 16f9b7bd

fix bloom issue

pursure-D committed 1 year ago

Verified bd2fcc9f

update W2g32 accuracy (#74)

pursure-D committed 1 year ago

Verified 16d830b4

Add baichuan-7b chat recipe (#73)

wenhuach21 committed 1 year ago

Verified b68eaf72

fix eval bug of autogptq model (#72)

yintong-lu committed 1 year ago

Verified b22f6aad

fix filter func issue (#71)

wenhuach21 committed 1 year ago

Verified 900154bf

support combination of calibration datasets (#70)

wenhuach21 committed 1 year ago

Verified e8fb5da1

[pre-commit.ci] pre-commit autoupdate (#69)

pre-commit-ci[bot] committed 1 year ago

Verified 9f7e8b81

fix Baichuan2-13B issue (#37)

WeiweiZhang1 committed 1 year ago

Verified fd3298b9

wenhuach21 committed 1 year ago

Verified 7ff627b8

Newer Older