llama.cpp
d1259b7b - llama : do not quantize expert gating tensors

Commit
1 year ago
llama : do not quantize expert gating tensors
Author
Parents
  • File
    llama.cpp