auto-round
disable quantizing lm-head with tied weights as a workaround
#102
Merged

Loading