transformers
49067bc9 - Add Four Over Six quantization integration (#43970)

Commit
39 days ago
Add Four Over Six quantization integration (#43970) * Add fouroversix integration and docs * Small fix * Update quantized modules outside of FourOverSixQuantize * Delete high-precision parameters during conversion and before weight loading * Update year Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> * Replace FourOverSixLinear check with QuantizedModule check * Run fix-repo * Update fouroversix * Update check_docstrings * Fix keep_master_weights * Update get_quantized_parameters API * Apply repo consistency fixes * Check whether each parameter in a module should be quantized * Update FourOverSixConfig * Fix docstring --------- Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Author
Parents
Loading