transformers
f9b9a5e8 - Update quantization overview for XPU (#40331)

Commit

247 days ago

Update quantization overview for XPU (#40331) * update xpu quantization overview Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix aqlm tests Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix format Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * update gguf support Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix gguf tests Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix xpu gguf precision error Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * replace deprecated models Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix import org Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * update xpu ggml tests Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * revert wrong change Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix xpu tests Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * xpu optimum-quanto goes green Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix format Signed-off-by: jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com> Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>

References

#40331 - Update quantization overview for XPU

Author

jiqing-feng

Parents

b824f498

transformers f9b9a5e8 - Update quantization overview for XPU (#40331)

transformers
f9b9a5e8 - Update quantization overview for XPU (#40331)