auto-round
Simulated W4Afp8 Quantization
#331
Merged

Simulated W4Afp8 Quantization #331

wenhuach21 merged 35 commits into main from fp8
wenhuach21
wenhuach21 try to support fp8
23e6ff42
wenhuach21 add files
fb17014f
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
35cd3762
wenhuach21 Merge branch 'main' into fp8
6bb17160
wenhuach21 tiny change
97d1237c
wenhuach21 fix
6d0c5828
wenhuach21 fix nan issue and change to dynamic per token
fe41d5af
wenhuach21 Merge branch 'main' into fp8
53ccc1c2
wenhuach21 Merge branch 'fp8' of https://github.com/intel/auto-round into fp8
cf6f7199
wenhuach21 support static quantization, the code is ugly
fe385c3f
wenhuach21 fix
73ebb24f
wenhuach21 refine a little
2b647808
wenhuach21 update a little
237f886d
wenhuach21 refine code, fp16 model are easily gen NAN grad, need to have a study
bd8fea4e
wenhuach21 Merge branch 'main' into fp8
cba9bd20
wenhuach21 tmp change
d9074475
wenhuach21 fix a critic bug
23d96043
wenhuach21 refine code
b5902596
wenhuach21 wenhuach21 added draft
wenhuach21 wenhuach21 marked this pull request as draft 1 year ago
wenhuach21 merge conv1d and fix conv1d exporting issue
8ba1137d
wenhuach21 Merge branch 'main' into fp8
dcaee165
wenhuach21 tmp change
bebe8f93
wenhuach21 Merge branch 'main' into fp8
41a7eab9
wenhuach21 fix issue
9c111878
wenhuach21 update
ac44576d
wenhuach21 wenhuach21 changed the title [WIP]try to support fp8 Simulated W4Afp8 Quantization 1 year ago
wenhuach21 wenhuach21 marked this pull request as ready for review 1 year ago
wenhuach21 Merge branch 'main' into fp8
0c98a2e6
wenhuach21 refine a little
2c190ae4
wenhuach21 Merge branch 'fp8' of https://github.com/intel/auto-round into fp8
2e095b8d
wenhuach21 fix preci issue
b0683228
wenhuach21 fix preci issue
80a74ae6
wenhuach21 wenhuach21 removed draft
wenhuach21 remove debug code
07acfd69
wenhuach21 try to fix ut
41ad46cc
wenhuach21 Merge branch 'main' into fp8
cd18f37c
wenhuach21 wenhuach21 requested a review from yiliu30 yiliu30 1 year ago
wenhuach21 wenhuach21 requested a review from WeiweiZhang1 WeiweiZhang1 1 year ago
wenhuach21 wenhuach21 requested a review from n1ck-guo n1ck-guo 1 year ago
WeiweiZhang1
WeiweiZhang1 approved these changes on 2024-11-28
n1ck-guo
n1ck-guo approved these changes on 2024-11-28
yiliu30 fix numba pack
5a94ac6b
wenhuach21 Merge branch 'main' into fp8
2ad66ca0
wenhuach21 fix comment
1608c978
wenhuach21 wenhuach21 merged a98175ff into main 1 year ago
wenhuach21 wenhuach21 deleted the fp8 branch 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone