text-generation-inference
16d0fb04
- Santacoder GPTQ support (quantized model seems awful, not sure if it's
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
Santacoder GPTQ support (quantized model seems awful, not sure if it's prompting or the quantization itself).
References
#438 - Inference support for GPTQ (llama + falcon tested) + Quantization script
Author
Narsil
Parents
983c813f
Loading