auto-round
support lm head quantizaiton and export to Intel cpu
#76
Merged

support lm head quantizaiton and export to Intel cpu #76

wenhuach21 merged 57 commits into main from lm-head
wenhuach21
wenhuach21 fix a bug in example
7d020db2
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
fbe69d5c
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
596a18f5
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
10add8ce
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
003b60a6
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
9d495140
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
3b7f386c
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
d3f14df6
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
76e4d909
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
08e46acb
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
15b756b0
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
40425bbd
wenhuach21 step 1 for setting up lm-head quantization
9484ffa9
wenhuach21 support basic quantization tuning of layers outside blocks
e3703d64
wenhuach21 support use_quant_input tuning of layers outside blocks
d6d40d2e
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
e38c3fa6
wenhuach21 fix some issues
f1a5cf8e
wenhuach21 fix some issues
9d8ee5ff
wenhuach21 save memory
ded6d615
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
e1711393
wenhuach21 ugly workaround to save lm-head memory
7ad949b4
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
0cbd77e7
wenhuach21 fix one issue
294bc006
wenhuach21 fix one issue
051aa435
wenhuach21 set to true as a workaround for itrex
0527c66c
wenhuach21 set to low_gpu_mem_usage to false as a workaround for itrex
fac2640f
wenhuach21 force low_gpu_mem_usage to false as a workaround for itrex
b5e31dc1
wenhuach21 add comments
971a36ca
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
aa987da8
wenhuach21 fix bugs
3d3a60e2
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
2a481eaa
Kaihui-intel Support XPU export (#77)
9b16ae2a
wenhuach21 revert the change
b1456ef8
wenhuach21 fix cost time
926ed382
wenhuach21 fix the scale dtype, need to revert later
38d90ed3
WeiweiZhang1 aligning XPU's export configuration with INC (#79)
366f4ffd
wenhuach21
wenhuach21 commented on 2024-04-19
WeiweiZhang1 remove only_quantize_blocks, update xpu shell
87d1ad31
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
9a570009
WeiweiZhang1 fix critical bug in lm head tuning (#82)
aed6ce87
wenhuach21 fix some issues
2044d78a
wenhuach21 fix some issues
3dbb7b3f
wenhuach21 fix one issue
c2da84fb
wenhuach21 Merge branch 'main' into lm-head
779ccd2d
wenhuach21 add comment
6fc9331b
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
7a11231d
wenhuach21 wenhuach21 changed the title initial support for quantization of layers outside of blocks support lm head quantizaiton and export to Intel cpu 1 year ago
wenhuach21 fix preci issue
624c03a2
WeiweiZhang1 refine hpu memory check
45a2e973
wenhuach21 follow weight config to decide whether quantizing lm-head or not
32870f82
wenhuach21 Merge branch 'lm-head' of https://github.com/intel/auto-round into lm…
6bfb8ab4
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
cb744d46
wenhuach21 fix bug
6d4b9d13
wenhuach21 Merge branch 'lm-head' of https://github.com/intel/auto-round into lm…
a2147d76
wenhuach21 fix bug and add unit test
36ac0840
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
6a93a2ea
wenhuach21 fix ut
23cd7580
wenhuach21 Merge branch 'lm-head' of https://github.com/intel/auto-round into lm…
56cf41c6
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
9a9a1f17
wenhuach21 wenhuach21 merged 16f9b7bd into main 1 year ago
wenhuach21 wenhuach21 deleted the lm-head branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone