transformers
Add HQQ quantization support
#29637
Merged

Add HQQ quantization support #29637

amyeroberts merged 78 commits into huggingface:main from stable
mobicham
amyeroberts
Minami-su
Minami-su approved these changes on 2024-03-29
SunMarc
SunMarc commented on 2024-03-29
SunMarc SunMarc requested a review from amyeroberts amyeroberts 2 years ago
younesbelkada younesbelkada requested a review from younesbelkada younesbelkada 2 years ago
younesbelkada
younesbelkada commented on 2024-04-11
rationalism
rationalism
rationalism
rationalism
rationalism
mobicham
rationalism
mobicham mobicham closed this 2 years ago
mobicham update HQQ transformers integration
bbc68fee
mobicham Merge branch 'huggingface:main' into stable
2a1f2245
mobicham mobicham reopened this 2 years ago
mobicham
mobicham push import_utils.py
e1e5df68
younesbelkada
younesbelkada commented on 2024-04-24
mobicham add force_hooks check in modeling_utils.py
0192b03b
mobicham fix | with Optional
823de372
mobicham force bias as param
08d7b8e6
mobicham check bias is Tensor
e1fa6c96
mobicham force forward for multi-gpu
6e854cae
younesbelkada
younesbelkada commented on 2024-04-25
mobicham review fixes pass
2b9f271a
younesbelkada
younesbelkada commented on 2024-04-25
SunMarc
SunMarc commented on 2024-04-25
mobicham remove torch grad()
5bb9ca25
mobicham if any key in linear_tags fix
392e7c5e
mobicham add cpu/disk check
20f9ad5b
mobicham isinstance return
3a5679a9
mobicham add multigpu test + refactor tests
7a1bbca2
mobicham clean hqq_utils imports in hqq.py
65b28879
mobicham clean hqq_utils imports in quantizer_hqq.py
bba74cd2
mobicham delete hqq_utils.py
de88c2af
mobicham Delete src/transformers/utils/hqq_utils.py
651a5863
mobicham ruff init
d07ea850
mobicham remove torch.float16 from __init__ in test
dedf69ec
mobicham refactor test
0edf8a43
mobicham isinstance -> type in quantizer_hqq.py
c7ec1239
younesbelkada
younesbelkada commented on 2024-04-26
SunMarc
SunMarc commented on 2024-04-26
mxjmtxrm
mobicham cpu/disk device_map check in quantizer_hqq.py
5283ac20
mobicham remove type(module) nn.linear check in quantizer_hqq.py
15daeb48
mobicham add BaseQuantizeConfig import inside HqqConfig init
bc4bc73e
mobicham remove hqq import in hqq.py
b54e87b2
mobicham remove accelerate import from test_hqq.py
0f9698af
mobicham quant config.py doc update
d31837fb
mobicham add hqqconfig to main_classes doc
b8f792c7
mobicham Merge branch 'huggingface:main' into stable
8b84cb1e
mobicham make style
9a061e56
mobicham __init__ fix
86122823
mobicham ruff __init__
b7867932
younesbelkada
younesbelkada commented on 2024-04-29
mobicham skip_modules list
e7ba7170
mobicham hqqconfig format fix
3a38f210
mobicham hqqconfig doc fix
9eee2131
mobicham hqqconfig doc fix
03cc8e6c
mobicham hqqconfig doc fix
96bd141b
mobicham hqqconfig doc fix
713d2261
mobicham hqqconfig doc fix
dad9a60d
mobicham hqqconfig doc fix
67c0985d
mobicham hqqconfig doc fix
94c393a8
mobicham hqqconfig doc fix
35fc9f50
mobicham hqqconfig doc fix
06f64978
HuggingFaceDocBuilderDev
younesbelkada
younesbelkada approved these changes on 2024-04-29
SunMarc
SunMarc approved these changes on 2024-04-29
mobicham test_hqq.py remove mistral comment
25fde9c7
mobicham remove self.using_multi_gpu is False
ee50516c
mobicham torch_dtype default val set and logger.info
01d798a4
amyeroberts
amyeroberts commented on 2024-04-03
mobicham hqq.py isinstance fix
a909ca8a
mobicham remove torch=None
c466c89a
mobicham torch_device test_hqq
d522fed9
mobicham rename test_hqq
a09e90ff
amyeroberts
amyeroberts commented on 2024-05-02
mobicham MODEL_ID in test_hqq
5bdf40f4
mobicham quantizer_hqq setattr fix
e693d473
mobicham quantizer_hqq typo fix
f5cabe58
mobicham imports quantizer_hqq.py
5ede086e
mobicham isinstance quantizer_hqq
c86000bc
mobicham hqq_layer.bias reformat quantizer_hqq
7d3e0839
mobicham Step 2 as comment in quantizer_hqq
082dfea5
mobicham prepare_for_hqq_linear() comment
667f1adb
mobicham keep_in_fp32_modules fix
e0cd7846
mobicham HqqHfQuantizer reformat
5d3b504e
mobicham quantization.md hqqconfig
cc1961cb
mobicham quantization.md model example reformat
9aa9e15a
mobicham quantization.md # space
9273e21d
mobicham quantization.md space })
f29e7a4e
mobicham quantization.md space })
5168852d
mobicham quantization_config fix doc
0dfe0806
mobicham axis value check in quantization_config
29340526
mobicham format
bc7cf4ee
mobicham dynamic config explanation
d33f944a
mobicham quant config method in quantization.md
3522f0a6
mobicham remove shard-level progress
cc14c211
mobicham .cuda fix modeling_utils
1e81036f
mobicham test_hqq fixes
ca07f5a3
mobicham Merge branch 'huggingface:main' into stable
4cc776e4
mobicham
mobicham make fix-copies
3d777ed7
younesbelkada
younesbelkada commented on 2024-05-02
mobicham Merge branch 'huggingface:main' into stable
b8088581
younesbelkada younesbelkada requested a review from amyeroberts amyeroberts 2 years ago
younesbelkada
mobicham Merge branch 'huggingface:main' into stable
5e711390
amyeroberts
amyeroberts approved these changes on 2024-05-02
amyeroberts
amyeroberts amyeroberts merged 59952994 into main 2 years ago
danielhanchen
mobicham
kadirnar
kadirnar
mobicham
kadirnar
mobicham
kadirnar
kadirnar
mobicham
appoose
kadirnar
appoose
kadirnar
appoose
huseinzol05
mobicham

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone