Remove lots of dead code, move triton to hard requirement
732da694
Triton is actually a dependency of torch on linux.
054a3d09
Narsilmarked this pull request as ready for review 2 years ago
Narsil
changed the title [WIP] Inference support for GPTQ (llama at least) Inference support for GPTQ (llama + falcon tested) + Quantization script2 years ago
Typo.
983c813f
Santacoder GPTQ support (quantized model seems awful, not sure if it's
Login to write a write a comment.
Login via GitHub