transformers
remove to restriction for 4-bit model
#33122
Merged

remove to restriction for 4-bit model #33122

SunMarc merged 5 commits into main from remove_to_4bit
SunMarc
SunMarc remove to restiction for 4-bit model
08f9c937
HuggingFaceDocBuilderDev
matthewdouglas
matthewdouglas
matthewdouglas commented on 2024-08-26
matthewdouglas
matthewdouglas matthewdouglas added Quantization
SunMarc Update src/transformers/modeling_utils.py
bb12e883
matthewdouglas bitsandbytes: prevent dtype casting while allowing device movement wi…
d064b48a
matthewdouglas
matthewdouglas commented on 2024-08-27
matthewdouglas quality fix
22f60881
matthewdouglas matthewdouglas marked this pull request as ready for review 1 year ago
matthewdouglas matthewdouglas requested a review from ArthurZucker ArthurZucker 1 year ago
matthewdouglas matthewdouglas requested a review from LysandreJik LysandreJik 1 year ago
LysandreJik
LysandreJik approved these changes on 2024-08-30
matthewdouglas Improve warning message for .to() and .cuda() on bnb quantized models
462ac2c3
ArthurZucker
ArthurZucker approved these changes on 2024-08-30
SunMarc SunMarc merged 9ea1eacd into main 1 year ago
SunMarc SunMarc deleted the remove_to_4bit branch 1 year ago
ukaprch

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone