auto-round
gguf weight type align with original, output.weight, token_embed
#900
Merged

Loading