llama.cpp
128118fd
- convert : use F32 for dequant of pack-quantized tensors
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
54 days ago
convert : use F32 for dequant of pack-quantized tensors
References
compilade/convert-prequant-compressed-tensors
#17069 - convert : handle compressed-tensors quant method
Author
compilade
Parents
3770d941
Loading