llama.cpp
0f1bb602
- model : remove duplicate wo_s scale after build_attn (Qwen3, LLaMA) (#22421)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
20 days ago
model : remove duplicate wo_s scale after build_attn (Qwen3, LLaMA) (#22421) Signed-off-by: Yash Nankani <ynankani@nvidia.com>
References
#22421 - fix(graph): remove duplicate wo_s scale after build_attn (Qwen3, LLaMA)
Author
ynankani
Parents
d13540be
Loading