auto-round
support real lm-head quantization and mixed precision inference
#114
Merged

support real lm-head quantization and mixed precision inference #114

wenhuach21 merged 52 commits into main from autoroundmodel
wenhuach21
wenhuach21 fix a bug in example
7d020db2
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
fbe69d5c
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
596a18f5
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
10add8ce
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
003b60a6
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
9d495140
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
3b7f386c
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
d3f14df6
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
76e4d909
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
08e46acb
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
15b756b0
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
40425bbd
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
f0b9ad00
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
04e70eca
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
43811bb9
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
c8817199
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
54920e56
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
e2c2f56c
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
4f718d45
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
022988aa
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
b4eb6790
wenhuach21 Merge branch 'main' of https://github.com/intel/auto-round
0abbde8e
wenhuach21 fix gradient_accmulate bug in lm-head
aed8be0d
wenhuach21 init update
a129f011
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
892cd8e7
wenhuach21 fix bias zero and Non key mismatch, but autogptq way is not optimal
48fd07ee
wenhuach21 Merge branch 'autoroundmodel' of https://github.com/intel/auto-round …
78b3fd10
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
70960a8f
wenhuach21 wip
1deda104
wenhuach21 Merge branch 'autoroundmodel' of https://github.com/intel/auto-round …
fcb28573
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
208ea37c
wenhuach21 basically finished the exporting part, the polish of loading code is …
72dba51e
wenhuach21 fix conflict
e791d863
wenhuach21 fix typo
ec28b729
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
f99b490e
wenhuach21 basically finish the loading part
31e4bd56
wenhuach21 fix conflict
6adf327d
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
2ee273b3
wenhuach21 fix some issues
03dd7f99
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
ad2f3def
wenhuach21 support lm eval for quantized lm-head model
8f2aa970
wenhuach21 Merge branch 'autoroundmodel' of https://github.com/intel/auto-round …
5b26ccd7
wenhuach21 force lm head to use gradient accumulate
b1229939
wenhuach21 revert debug code
4e3e9716
wenhuach21 tiny change
5f13bb0c
wenhuach21 Merge branch 'main' into autoroundmodel
c8757264
wenhuach21 Merge branch 'autoroundmodel' of https://github.com/intel/auto-round …
33d84537
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
e5fc181b
wenhuach21 fix line too long issue
209e1d04
wenhuach21 Merge branch 'autoroundmodel' of https://github.com/intel/auto-round …
83424dec
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
6a74a7df
wenhuach21 wenhuach21 changed the title [WIP] support real lm-head quantization and mixed precision inference support real lm-head quantization and mixed precision inference 1 year ago
wenhuach21 add mistral v0.1 lm-head acc and model
59ca3596
wenhuach21 wenhuach21 merged 4d1caebb into main 1 year ago
wenhuach21 wenhuach21 deleted the autoroundmodel branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone