transformers
d0b1d8d8
- Skip DeepSpeed ZeRO Stage 3 model initialization when bnb (#34395)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
Skip DeepSpeed ZeRO Stage 3 model initialization when bnb (#34395) * Skip DeepSpeed ZeRO Stage 3 model initialization when it is intended to be quantized. * Propagate the quantization state using a context manager * make fixup
References
#34395 - Skip DeepSpeed ZeRO Stage 3 model initialization when bnb
Author
eljandoubi
Parents
eb811449
Loading