text-generation-inference
c6e8b944
- fix(server): fix quantization for sharded models (#45)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
3 years ago
fix(server): fix quantization for sharded models (#45)
References
#45 - fix(server): fix quantization for sharded models
Author
OlivierDehaene
Parents
017a2a8c
Loading