llama.cpp
convert : handle compressed-tensors quant method
#17069
Merged

convert : handle compressed-tensors quant method #17069

compilade
compilade convert : handle compressed-tensors quant method
33dba6ce
compilade convert : handle int-quantized models
d23bdd57
compilade convert : handle naive-quantized models
33dcb44a
compilade gguf-py : __pos__ is also unary
987862ad
compilade convert : fix flake8 lint
3770d941
compilade convert : use F32 for dequant of pack-quantized tensors
128118fd
compilade compilade requested a review from CISC CISC 102 days ago
github-actions github-actions added python
compilade compilade added enhancement
ubergarm
ubergarm
ggerganov ggerganov requested a review from ngxson ngxson 101 days ago
CISC
CISC approved these changes on 2025-11-07
ngxson
ngxson approved these changes on 2025-11-07
jukofyork
ngxson
jukofyork
jukofyork
ngxson
csabakecskemeti
jukofyork
compilade
CISC
compilade compilade merged 1c07c0c6 into master 99 days ago
jukofyork

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone