llama.cpp
gguf-py : add Numpy MXFP4 de/quantization support
#15111
Merged

gguf-py : add Numpy MXFP4 de/quantization support #15111

compilade merged 2 commits into master from compilade/gguf-py-mxfp4
compilade
compilade gguf-py : add MXFP4 de/quantization support
141cab13
compilade compilade added python
compilade compilade added Tensor Encoding Scheme
compilade
compilade commented on 2025-08-06
CISC
CISC approved these changes on 2025-08-06
ngxson
compilade ggml-quants : handle zero amax for MXFP4
2763dc8b
github-actions github-actions added ggml
compilade
slaren
slaren approved these changes on 2025-08-07
compilade compilade merged e54d41be into master 34 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone