llama.cpp
llama : fix non-quantization of expert gating tensors
#5754
Merged

llama : fix non-quantization of expert gating tensors #5754

compilade
compilade llama : fix non-quantization of expert gating tensors
969be5d4
cebtenzzre
cebtenzzre approved these changes on 2024-02-27
ggerganov
ggerganov approved these changes on 2024-02-28
ggerganov ggerganov merged adcb12a9 into master 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone