llama.cpp
141cab13
- gguf-py : add MXFP4 de/quantization support
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
65 days ago
gguf-py : add MXFP4 de/quantization support
References
#15111 - gguf-py : add Numpy MXFP4 de/quantization support
Author
compilade
Parents
fd1234cb
Loading