Modify device_map behavior when loading a model using from_pretrained (#23922)
* Modify device map behavior for 4/8 bits model
* Remove device_map arg for training 4/8 bit model
* Remove index
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Add Exceptions
* Modify comment
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Fix formatting
* Get current device with accelerate
* Revert "Get current device with accelerate"
This reverts commit 46f00799103bbe15bd58762ba029aab35363c4f7.
* Fix Exception
* Modify quantization doc
* Fix error
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>