llama.cpp
ggml : add NVFP4 quantization type support
#19769
Merged

ggml : add NVFP4 quantization type support #19769

CISC merged 52 commits into ggml-org:master from richarddd:feat/nvfp4
richarddd
richarddd richarddd requested a review from ggerganov ggerganov 40 days ago
richarddd richarddd requested a review from CISC CISC 40 days ago
richarddd richarddd requested a review from 0cc4m 0cc4m 40 days ago
richarddd richarddd requested a review from JohannesGaessler JohannesGaessler 40 days ago
richarddd richarddd force pushed from 9cd0f586 to 86dd3fc6 40 days ago
github-actions github-actions added testing
github-actions github-actions added Nvidia GPU
github-actions github-actions added Vulkan
github-actions github-actions added python
github-actions github-actions added ggml
github-actions github-actions added Apple Metal
JohannesGaessler
pwilkin
jeffbolznv
JohannesGaessler
ggerganov
am17an
pwilkin
pwilkin
richarddd richarddd marked this pull request as draft 38 days ago
richarddd
am17an
richarddd
richarddd
am17an
richarddd richarddd force pushed from 5f8f21bb to ffab58b2 38 days ago
richarddd
github-actions github-actions added examples
richarddd richarddd marked this pull request as ready for review 37 days ago
am17an
am17an commented on 2026-02-25
am17an
am17an commented on 2026-02-25
richarddd
am17an
am17an commented on 2026-02-25
CISC
CISC commented on 2026-02-25
am17an
am17an commented on 2026-02-26
richarddd richarddd force pushed from 3cbb4e37 to fa669191 34 days ago
am17an
richarddd
am17an
pwilkin
pwilkin
richarddd
pwilkin
am17an
am17an commented on 2026-03-01
CISC
CISC commented on 2026-03-01
CISC
richarddd
CISC
CISC commented on 2026-03-01
ggerganov
richarddd richarddd changed the title WIP: ggml : add NVFP4 quantization type support ggml : add NVFP4 quantization type support 30 days ago
michaelw9999
am17an
michaelw9999
michaelw9999
am17an
richarddd
ggerganov
ggerganov commented on 2026-03-04
gocursor
github-actions github-actions added model
CISC
richarddd WIP: add NVFP4 quantization support
98bf995c
richarddd tests
7138a3cc
richarddd improve NVFP4 dot product implementation performance and fix bad sup…
91fd8f7b
richarddd typo
c55390f8
richarddd Use nvfp4 kvalues
8a3b35f4
richarddd vulkan : fix NVFP4 shader compilation by including kvalues_mxfp4 look…
9a3d804c
richarddd vulcal and perf fixes
ad18a561
richarddd wip
270eba7a
richarddd Fix metal
03df285f
richarddd fix vulcan
457ee2b7
richarddd Rename threshold & fix wrong scale
2d91e235
richarddd Fix MOE
9936919b
richarddd Shelf backend implementations (CUDA, Metal, Vulkan, arch-specific SIMD)
4303f97d
richarddd Fix arch-fallback.h: add NVFP4 generic fallback for all platforms
39a3734f
richarddd quantize: add NVFP4 as a quantization type option
256d0b1b
richarddd Fix ggml_fp32_to_ue4m3: handle subnormal values
ddc93e50
richarddd Restore ARM NEON NVFP4 dot product implementation
1d291020
richarddd Optimize ARM NEON NVFP4 dot product: LUT + vpaddq + vfmaq
0d015ef5
richarddd ARM NEON NVFP4: rearrange q8 to match nibble layout
525c76a4
richarddd CPU only backend 64 super-block layout
707e088d
richarddd cleanup
4e4275d0
richarddd Remove unused LUT
48632879
richarddd int
3c3e662a
richarddd exclude NVFP4 from unsupported ops in metal build
93bbcadf
richarddd remove quantization for now
9ceb0025
richarddd store scales as native UE4M3, preserve original model bits when possible
4bfb3188
richarddd Update convert_hf_to_gguf.py
3b4ebe5b
richarddd correct comment
ad9d68a6
richarddd format
0717bfc4
richarddd reduce duplication and cleanup
bd4b67ce
richarddd Address comments
a3b0c749
richarddd move detection to prepare_tensors
396b8241
richarddd Use math instead of const
c21e1df8
richarddd Move
5cc15166
richarddd fix comment
f2b6dce1
richarddd Shelf quantize tests
a567d3ed
richarddd Rebase and move check
677eedb0
richarddd cleanup
7dead730
richarddd lint
e400ac78
richarddd Update gguf-py/gguf/scripts/gguf_convert_endian.py
52b25d3e
richarddd cleanup
7dead730
richarddd lint
e400ac78
richarddd organize
b005f559
richarddd Refactor
db618aa7
richarddd richarddd force pushed from 93ab4d7a to db618aa7 24 days ago
richarddd
ggerganov
ggerganov approved these changes on 2026-03-09
ggerganov ggerganov requested a review from CISC CISC 24 days ago
CISC
CISC approved these changes on 2026-03-09
richarddd Update convert_hf_to_gguf.py
2b465a6f
richarddd Update convert_hf_to_gguf.py
27c28316
richarddd Update convert_hf_to_gguf.py
d2b9d373
CISC add quantize_nvfp4 (required for test_quants.py)
0870ec53
CISC add quantize_nvfp4 (required for test_quants.py)
238a9125
CISC add quantize_nvfp4 (required for test_quants.py)
215787e0
CISC
CISC fix return type
dab2f826
CISC
ggerganov
ORippler
ORippler dismissed these changes on 2026-03-10
michaelw9999
CISC
CISC
ORippler
ORippler
ORippler
CISC
richarddd Merge branch 'master' into feat/nvfp4
51f757c2
CISC CISC dismissed their stale review 21 days ago
https://github.com/ggml-org/llama.cpp/pull/19769#issuecomment-4040742527
CISC CISC merged 5eae9cb1 into master 21 days ago
ggerganov
CISC
ggerganov
CISC
ggerganov
CISC
richarddd richarddd deleted the feat/nvfp4 branch 21 days ago
am17an
JohannesGaessler
michaelw9999
richarddd
JohannesGaessler
xkmire
michaelw9999
vbooka1
CISC
ORippler
ORippler
JohannesGaessler
ORippler
michaelw9999
JohannesGaessler
CISC
michaelw9999
JohannesGaessler

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone