llama.cpp
745cbcf2 - llama-quant : fix the verification of attention layers for encoder-decoder models (#16023)

Commit
131 days ago
llama-quant : fix the verification of attention layers for encoder-decoder models (#16023) Signed-off-by: Jie Fu <jiefu@tencent.com>
Author
Parents
Loading