llama.cpp
llama-quant : fix the verification of attention layers for encoder-decoder models
#16023
Merged

llama-quant : fix the verification of attention layers for encoder-decoder models #16023

CISC merged 1 commit into ggml-org:master from DamonFool:llama-quant-t5
DamonFool
DamonFool llama-quant : fix the verification of attention layers for encoder-de…
22ccb6ad
CISC
CISC commented on 2025-09-16
DamonFool
CISC
CISC approved these changes on 2025-09-17
CISC CISC merged 745cbcf2 into master 268 days ago
DamonFool
DamonFool DamonFool deleted the llama-quant-t5 branch 268 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone