llama.cpp
558a7647 - Force FP32 compute in GLM4 FFN Down (#13101)

Commit
167 days ago
Force FP32 compute in GLM4 FFN Down (#13101) * Force FP32 compute in cuBLAS GEMM * Revert "Force FP32 compute in cuBLAS GEMM" This reverts commit 6efd872732159ab88ee7b3c1d77ba5ebc83079bd. * Force F32 compute in GLM4 ffn down * Edit comment to clarify issue Co-authored-by: Johannes Gäßler <johannesg@5d6.de> --------- Co-authored-by: Johannes Gäßler <johannesg@5d6.de>
Author
Parents
Loading