llama.cpp
4fcd87cf
- gguf-py : skip endian-conversion of MXFP4 data (#17523)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
18 days ago
gguf-py : skip endian-conversion of MXFP4 data (#17523) * gguf_convert_endian.py: skip MXFP4 data * Use gguf.constants.GGML_QUANT_SIZES to determine block sizes
References
#17523 - gguf_convert_endian.py: skip MXFP4 data
Author
AlekseiNikiforovIBM
Parents
b78db3bd
Loading