llama.cpp
93b9baee - convert-hf : reduce stacked MoE conversion RAM usage by a third

Commit

1 year ago

convert-hf : reduce stacked MoE conversion RAM usage by a third

Author

compilade

compilade

Parents

Loading