transformers
9c9fe89f - [run_clm example] add torch_dtype option for model load. (#20971)

Commit

3 years ago

[run_clm example] add torch_dtype option for model load. (#20971) * [run_clm example] add torch_dtype option for model load. for BLOOM 175B model. peak memory will reduce about 350G for inference. the weight of BLOOM in model hub is bfloat16 Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> * add other type in option * fix style Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

References

#20971 - [run_clm example] add torch_dtype option for model load.

#27720 - Add common processor tests

#29969 - [SigLIP] Add fast tokenizer

#32831 - [Docs] Update resources

#33111 - [Backbone] Remove out_features everywhere

#33174 - [Zero-shot image classification pipeline] Remove tokenizer_kwargs

#39821 - Support MetaCLIP 2

#59 - Fix attention mask handling in EoMT-DINOv3 converter

#41212 - Add EoMT with DINOv3 backbone

#62 - Add initial DEIMv2 model implementation

Author

sywangyi

Parents

e697c912

transformers 9c9fe89f - [run_clm example] add torch_dtype option for model load. (#20971)

transformers
9c9fe89f - [run_clm example] add torch_dtype option for model load. (#20971)