llama.cpp
58b515ca - convert-hf : add --outtype auto-f16

Commit

1 year ago

convert-hf : add --outtype auto-f16 A reason for this to exist is for model quantizers who want an initial GGUF with the most fidelity to the original model while still using a 16-bit float type instead of 32-bit floats.

References

#7158 - convert-hf : support bfloat16 conversion

Author

compilade

Committer

compilade

Parents

95930da3

Files2

convert-hf-to-gguf.py
gguf-py/gguf
- constants.py

llama.cpp 58b515ca - convert-hf : add --outtype auto-f16

llama.cpp
58b515ca - convert-hf : add --outtype auto-f16