llama.cpp
llama : fix non-quantization of expert gating tensors
#5754
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
1
Changes
View On
GitHub
llama : fix non-quantization of expert gating tensors
#5754
ggerganov
merged 1 commit into
ggml-org:master
from
compilade:fix-no-quantize-ffn-gate-inp
llama : fix non-quantization of expert gating tensors
969be5d4
cebtenzzre
approved these changes on 2024-02-27
ggerganov
approved these changes on 2024-02-28
ggerganov
merged
adcb12a9
into master
1 year ago
Login to write a write a comment.
Login via GitHub
Reviewers
ggerganov
cebtenzzre
Assignees
No one assigned
Labels
None yet
Milestone
No milestone
Login to write a write a comment.
Login via GitHub