llama.cpp
f8f071fa
- convert : handle pre-quantized models (#14810)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
64 days ago
convert : handle pre-quantized models (#14810) * convert : begin handling pre-quantized models * convert : fix conversion from FP8 for Deepseek-V3.1-Base
References
#14810 - convert : handle pre-quantized models
Author
compilade
Parents
0bf47a1d
Loading