llama.cpp
Add AWQ (Activation-aware Weight Quantization) for llama, llama2, mpt, and mistral models
#4593
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
34
Changes
View On
GitHub
Loading