Support loading for static quant weight fp8 act fp8 #730
load w8a8
bb947822
refactor
9bef8263
add ut
b30a126f
remove example
eaad3a6e
fix typo
c411ca5f
Merge branch 'main' into wfp8-afp8
98023136
Update auto_round/export/export_to_autoround/export_to_fp8_woq.py
6597d5ca
Update export_to_fp8_woq.py
9b0f32ff
Merge branch 'main' into wfp8-afp8
c32daa6f
yiliu30
marked this pull request as ready for review 143 days ago
megre main
c136339c
update shape
5ebca24b
refactor
03cb2171
Merge branch 'main' into wfp8-afp8
e7280f69
tmp add bk
66388e53
refactor code
17ddd2d0
refine code
808449d7
yiliu30
changed the title Support loading for static quant weight fp8 act fp8 [WIP]Support loading for static quant weight fp8 act fp8 128 days ago
fix device list
f74ed6f6
fix
632cf8a9
refactor code
5b8b29d4
fix
57b4c199
update
bdf5f3e5
fix ut
ce3384f3
Merge branch 'main' into wfp8-afp8
7cea90ed
correct
22d11de1
clean
90826139
Merge branch 'wfp8-afp8' of https://github.com/intel/auto-round into …
6503355b
Merge branch 'main' into wfp8-afp8
b6876334
yiliu30
changed the title [WIP]Support loading for static quant weight fp8 act fp8 Support loading for static quant weight fp8 act fp8 127 days ago
fix shape
2202856f
Merge branch 'wfp8-afp8' of https://github.com/intel/auto-round into …
10f5753b
merge with main
cc42e47f
fix check
d0b99a8f
clean code
31845d0d
yiliu30
changed the title Support loading for static quant weight fp8 act fp8 [WIP]Support loading for static quant weight fp8 act fp8 122 days ago
yiliu30
marked this pull request as draft 122 days ago
merge
fdecddec
fix backend check
1f2e6749
Merge branch 'main' into wfp8-afp8
b56ad253
update config
4cec318f
revert change
6b2962fd
fix
638718e7
yiliu30
marked this pull request as ready for review 120 days ago
yiliu30
changed the title [WIP]Support loading for static quant weight fp8 act fp8 Support loading for static quant weight fp8 act fp8 120 days ago
fix
4df3e8f7
update
e01603ce
propagate the config
0cdf28b1
pass config to checker
27910da0
add more check
d46acdb2
refine code
fd057993
fix equal check
3d75c276
fix equal check
e0c0d58e
Merge branch 'wfp8-afp8' of https://github.com/intel/auto-round into …
75f2928c
fix get
fa3ec2dd
rename
ad5269e0
update check
35e45ed0
add warning
f4e254ba
Merge branch 'main' into wfp8-afp8
7cba242e
rename check
ff5a1e99
Merge branch 'main' into wfp8-afp8
b98f3db9
rename
50968fd0
Merge branch 'main' into wfp8-afp8
586d6a2e
Merge branch 'main' into wfp8-afp8
5e84ff98
[pre-commit.ci] auto fixes from pre-commit.com hooks
abd83acf
Merge branch 'main' into wfp8-afp8
9e2c63f3
[pre-commit.ci] auto fixes from pre-commit.com hooks
d332a957
Merge branch 'main' into wfp8-afp8
94508e39
[pre-commit.ci] auto fixes from pre-commit.com hooks
8a4a5334
fix
f05e38ba
Merge branch 'main' into wfp8-afp8
d759ca3f
update
c58a61c6
Merge branch 'main' into wfp8-afp8
c89ffc05
Merge branch 'main' into wfp8-afp8
04ae0fdc
fix
2c34244c
Merge branch 'wfp8-afp8' of https://github.com/intel/auto-round into …
b3a09107
yiliu30
merged
09e4d312
into main 114 days ago
yiliu30
deleted the wfp8-afp8 branch 114 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub