transformers
d0b1d8d8 - Skip DeepSpeed ZeRO Stage 3 model initialization when bnb (#34395)

Commit
1 year ago
Skip DeepSpeed ZeRO Stage 3 model initialization when bnb (#34395) * Skip DeepSpeed ZeRO Stage 3 model initialization when it is intended to be quantized. * Propagate the quantization state using a context manager * make fixup
Author
Parents
Loading