auto-round
a52c81ef - Reduce RAM usage of quantizing VLM models and fix issues of quantizing gemma4

Commit
2 days ago
Reduce RAM usage of quantizing VLM models and fix issues of quantizing gemma4 Signed-off-by: lvliang-intel <liang1.lv@intel.com>
Author
Committer
Parents
Loading