auto-round
support real lm-head quantization and mixed precision inference
#114
Merged

Loading