auto-round
[GGUF] using quant_nontext_module to control whether quant vision model
#1317
Merged

Loading