llama.cpp
d7f794ea
- convert : avoid dequantizing mxfp4 for GPT-OSS
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
120 days ago
convert : avoid dequantizing mxfp4 for GPT-OSS
References
compilade/fix-prequant-mxfp4-gpt-oss
#16756 - convert : avoid dequantizing mxfp4 for GPT-OSS
Author
compilade
Committer
compilade
Parents
5a91109a
Loading