Add offload for 8-bit model (#1699)
* Add offload for 8-bit model
* fix saved 8bit model offload and add tests
* Update src/accelerate/utils/modeling.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update src/accelerate/utils/modeling.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* add doc on how offload works
* remove enable_offload
* make style doc
---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>