text-generation-inference
6bf7090e
- fix per-column quantization
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
fix per-column quantization
References
#666 - feat(server): Add exllama GPTQ CUDA kernel support #553
Author
fxmarty
Parents
edfbfdfb
Loading