llama.cpp
0f1bb602 - model : remove duplicate wo_s scale after build_attn (Qwen3, LLaMA) (#22421)

Commit
20 days ago
model : remove duplicate wo_s scale after build_attn (Qwen3, LLaMA) (#22421) Signed-off-by: Yash Nankani <ynankani@nvidia.com>
Author
Parents
Loading