transformers
884a8ea1 - Improve model loading for compressed tensor models (#36152)

Commit
1 year ago
Improve model loading for compressed tensor models (#36152) * Disable warnings for stacked compressors * Introduce two new hooks in HfQuantizer lifecycle to allow updates to missing and unexpected keys * Update missing and unexpected keys for stacked compressors * Add tests * Fix: run_compressed cases * Fix: uncompressed cases * Rename compressed_tensor folder to compressed_tensors Move RunCompressedTest to the same file Update tests to unittest
Author
Parents
Loading