llama.cpp
d1259b7b - llama : do not quantize expert gating tensors

Commit

1 year ago

llama : do not quantize expert gating tensors

References

#4406 - llama : add Mixtral support

Author

ggerganov

ggerganov

Parents

Loading