Update quantization overview for XPU (#40331)
* update xpu quantization overview
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix aqlm tests
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix format
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* update gguf support
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix gguf tests
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix xpu gguf precision error
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* replace deprecated models
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix import org
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* update xpu ggml tests
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* revert wrong change
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix xpu tests
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* xpu optimum-quanto goes green
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix format
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
---------
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>