llama.cpp
d1259b7b
- llama : do not quantize expert gating tensors
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Hide Minimap (CTRL+M)
Commit
1 year ago
llama : do not quantize expert gating tensors
References
#4406 - llama : add Mixtral support
Author
ggerganov
Parents
6cfb31f9
Files
1
llama.cpp
Loading