llama.cpp
llama : fix non-quantization of expert gating tensors
#5754

Merged

llama : fix non-quantization of expert gating tensors #5754

ggerganov merged 1 commit into ggml-org:master from compilade:fix-no-quantize-ffn-gate-inp

llama : fix non-quantization of expert gating tensors

969be5d4

cebtenzzre approved these changes on 2024-02-27

ggerganov approved these changes on 2024-02-28

ggerganov merged adcb12a9 into master 2 years ago

Reviewers

ggerganov

cebtenzzre

Assignees

No one assigned

Labels

None yet

Milestone

No milestone