transformers
HFQuantizer implementation for compressed-tensors library
#31704
Merged

HFQuantizer implementation for compressed-tensors library #31704

bfineran
Add compressed-tensors HFQuantizer implementation
d695ec3e
flag serializable as False
f4689647
run
41224d3d
revive lines deleted by ruff
b61bfb96
fixes to load+save from sparseml, edit config to quantization_config,…
ff8f1c5a
address satrat comment
c1cb55de
compressed_tensors to compressed-tensors and revert back is_serializable
ef9d3f17
rename quant_method from sparseml to compressed-tensors
117d0504
tests
1901c3e5
edit tests
3ca270df
clean up tests
9a14b092
make style
ec59052d
cleanup
520ded87
cleanup
7dec8fc8
SunMarc
bfineran
bfineran Merge branch 'main' into compressed-tensors-quantizer
afb550da
add test skip for when compressed tensors is not installed
d9b36601
remove pydantic import + style
e51ac594
delay torch import in test
ccb54423
initial docs
bfd9220b
update main init for compressed tensors config
71a80f92
make fix-copies
547f9cce
docstring
8acbc090
remove fill_docstring
eaa5f20b
SunMarc
SunMarc commented on 2024-07-31
bfineran
bfineran commented on 2024-08-06
bfineran Apply suggestions from code review
4ba75fbc
review comments
94ea0d3c
review comments
c48840d0
bfineran Merge branch 'main' into compressed-tensors-quantizer
ab74d26c
mgoin
mgoin commented on 2024-08-19
comments - suppress warnings on state dict load, tests, fixes
2ecf7110
bfineran
bug-fix - remove unnecessary call to apply quant lifecycle
e1ae5049
run_compressed compatability
ea9e927c
SunMarc
SunMarc commented on 2024-09-02
SunMarc
SunMarc commented on 2024-09-02
SunMarc
SunMarc commented on 2024-09-02
revert changes not needed for compression
1c3ad5c9
no longer need unexpected keys fn
aa1a4f97
unexpected keys not needed either
81a13dd7
SunMarc
SunMarc
SunMarc approved these changes on 2024-09-05
SunMarc SunMarc requested a review from ArthurZucker ArthurZucker 1 year ago
Satrat Apply suggestions from code review
f53d7b99
add to_diff_dict
d8f7073c
update docs and expand testing
c4fbf70f
SunMarc
SunMarc
jvlinsta
Merge remote-tracking branch 'upstream/main' into compressed-tensors-…
1992a887
Satrat
Satrat Update _toctree.yml with compressed-tensors
298a6387
hyaticua
HuggingFaceDocBuilderDev
ArthurZucker
ArthurZucker approved these changes on 2024-09-21
Satrat Update src/transformers/utils/quantization_config.py
3cb44153
dsikka Merge branch 'main' into compressed-tensors-quantizer
a9431576
dsikka update doc
64f475ad
dsikka add note about saving a loaded model
fabe8a31
dsikka
SunMarc
SunMarc approved these changes on 2024-09-25
SunMarc SunMarc merged 574a9e12 into main 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone