text-generation-inference
f90c61a3
- support bits different than 4
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
support bits different than 4
References
#666 - feat(server): Add exllama GPTQ CUDA kernel support #553
Author
fxmarty
Parents
67d68760
Loading