llama.cpp
fix(quantize): add NVFP4 default type mapping and scale tensors
#22897
Open

fix(quantize): add NVFP4 default type mapping and scale tensors #22897

t-timms wants to merge 2 commits into ggml-org:master from t-timms:fix/nvfp4-quantizer-scales
t-timms
t-timms fix(quantize): add NVFP4 default type mapping and scale tensors
56f62c48
t-timms perf(quantize): compute optimal NVFP4 .scale via MSE minimization
f5df5181
t-timms t-timms requested a review from ggerganov ggerganov 39 days ago
github-actions github-actions added examples
ggml-gh-bot
t-timms

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone