llama.cpp
58b515ca
- convert-hf : add --outtype auto-f16
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Hide Minimap (CTRL+M)
Commit
1 year ago
convert-hf : add --outtype auto-f16 A reason for this to exist is for model quantizers who want an initial GGUF with the most fidelity to the original model while still using a 16-bit float type instead of 32-bit floats.
References
#7158 - convert-hf : support bfloat16 conversion
Author
compilade
Committer
compilade
Parents
95930da3
Files
2
convert-hf-to-gguf.py
gguf-py/gguf
constants.py
Loading