transformers
6bf6f849 - [`Mxfp4`] Add a way to save with a quantization method (#40176)

Commit

278 days ago

[`Mxfp4`] Add a way to save with a quantization method (#40176) * add a test * tempdir * fix import issue[ * wow I am tired * properly init * i am not super familiar with quantizer api :| * set to TRUE fro now * full support * push current changes * will clean this later but the imports are a shitshow here * this correctly saves the block and scales but forward seems broken * quanitze was not correct * fix storage * why were bias even included * finally! * style * fix style * remove print * lazy import * up * not sure what happens this works now? * holy molly it was not so far * okay this seems to work! * workings!!! * allow save_pretrained to create PR * Apply suggestions from code review * fixup * add deqyabtze fakse as wek * working new * fix * rm swizzle and unswizzle during saving * rm print * Update src/transformers/modeling_utils.py * fix * style --------- Co-authored-by: Marc Sun <marc@huggingface.co>

References

#40176 - [`Mxfp4`] Add a way to save with a quantization method

Author

ArthurZucker

Parents

04c2bae3

transformers 6bf6f849 - [`Mxfp4`] Add a way to save with a quantization method (#40176)

transformers
6bf6f849 - [`Mxfp4`] Add a way to save with a quantization method (#40176)