FP-Quant support (#38696)
* quartet
* quartet qat -> quartet
* format
* bf16 backward
* interfaces
* forward_method
* quartet -> fp_quant
* style
* List -> list
* list typing
* fixed format and annotations
* test_fp_quant
* docstrings and default dtypes
* better docstring and removed noop checks
* docs
* pseudoquantization support to test on non-blackwell
* pseudoquant
* Pseudoquant docs
* Update docs/source/en/quantization/fp_quant.md
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
* Update docs/source/en/quantization/fp_quant.md
* Update docs/source/en/quantization/fp_quant.md
* Update src/transformers/utils/quantization_config.py
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
* Update tests/quantization/fp_quant_integration/test_fp_quant.py
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
* Update tests/quantization/fp_quant_integration/test_fp_quant.py
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
* small test fixes
* dockerfile update
* spec link
* removed `_process_model_after_weight_loading`
* toctree
---------
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>