text-generation-inference
732da694
- Remove lots of dead code, move triton to hard requirement
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
Remove lots of dead code, move triton to hard requirement - Added option to upload to hub directly after quantizing.
References
#438 - Inference support for GPTQ (llama + falcon tested) + Quantization script
Author
Narsil
Parents
5de68637
Loading