llama.cpp
Force NVFP4 W4A8 path for NVFP4_W4A16 layers on Blackwell, where NVFP4 normally uses the native W4A4 path.
#24364
Open

Force NVFP4 W4A8 path for NVFP4_W4A16 layers on Blackwell, where NVFP4 normally uses the native W4A4 path. #24364

ynankani
ynankani ynankani requested a review from ggerganov ggerganov 12 days ago
ynankani ynankani requested a review from CISC CISC 12 days ago
ynankani ynankani requested a review 12 days ago
github-actions github-actions added testing
github-actions github-actions added Nvidia GPU
github-actions github-actions added python
github-actions github-actions added ggml
sanmai
sanmai commented on 2026-06-10
am17an
am17an commented on 2026-06-10
ORippler
ynankani Force NVFP4 W4A8 path for NVFP4_W4A16 layers
dfee78d3
ynankani Add a Knob to allow W4A4 for user, even if checkpoint specifies W4A16…
18f1df39
ynankani ynankani force pushed from b72a8c94 to 18f1df39 4 days ago
github-actions github-actions added documentation
github-actions github-actions added CUDA

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone