llama.cpp
93b9baee
- convert-hf : reduce stacked MoE conversion RAM usage by a third
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
convert-hf : reduce stacked MoE conversion RAM usage by a third
Author
compilade
Parents
6f1b6360
Loading