transformers
86777b5e - Support `AOPerModuleConfig` and `include_embedding` (#37802)

Commit

228 days ago

Support `AOPerModuleConfig` and `include_embedding` (#37802) * Support `AOPerModuleConfig` and include_embedding Summary: This PR adds support per module configuration for torchao Also added per module quantization examples: 1. Quantizing different layers with different quantization configs 2. Skip quantization for certain layers Test Plan: python tests/quantization/torchao_integration/test_torchao.py -k test_include_embedding python tests/quantization/torchao_integration/test_torchao.py -k test_per_module_config_skip Reviewers: Subscribers: Tasks: Tags: * format * format * inlcude embedding remove input embedding from module not to convert * more docs * Update docs/source/en/quantization/torchao.md Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> * Update src/transformers/quantizers/quantizer_torchao.py Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> * Update src/transformers/quantizers/quantizer_torchao.py Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> --------- Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>

References

#37802 - Support `AOPerModuleConfig` and `include_embedding`

Author

jerryzh168

Parents

c3aeaa80

transformers 86777b5e - Support `AOPerModuleConfig` and `include_embedding` (#37802)

transformers
86777b5e - Support `AOPerModuleConfig` and `include_embedding` (#37802)