llama.cpp
a4d2d4ae - convert : add compressed-tensors NVFP4 support (#21095)

Commit
3 days ago
convert : add compressed-tensors NVFP4 support (#21095) * Refactored Compressed Tensors NVFP4 support for new base.py * Support compressed-tensors NVFP4 conversion * Moved Qwen MTP remap into filter_tensors * simplify * pathlib no longer used --------- Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
Author
Parents
Loading