Fix GLM4 incoherence with fp16 accumulators #13639
Set GLM4 blk.*.attn_output.weight, kqv_out-* matmul to GGML_PREC_F32 …
adefa985
ggerganov
approved these changes
on 2025-05-20
0cc4m
merged
c9c64dee
into master 1 year ago
0cc4m
deleted the 0cc4m/fix-vulkan-glm4 branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub