auto-round
api support for fp8 model and mllm api support load from str
#752
Merged

Loading