ggml
ff54fda8
- gpt-2 : loading Q4_0 quantized model
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
3 years ago
gpt-2 : loading Q4_0 quantized model
References
#27 - 4-bit Integer quantisation
Author
ggerganov
Committer
ggerganov
Parents
21514b72
Loading