llama.cpp
llama:use F32 precision in GLM4 attention and no FA
#9130

Merged

llama:use F32 precision in GLM4 attention and no FA #9130

ggerganov merged 1 commit into ggml-org:master from piDack:fix_glm4_ggg_err

fix glm GGG err

8a6ba03c

piDack changed the title ~~fix glm GGG err~~ Fix glm4 GGG err 1 year ago

piDack changed the title ~~Fix glm4 GGG err~~ llama:use F32 precision in GLM4 attention and no FA 1 year ago

ggerganov approved these changes on 2024-08-23

ggerganov merged a07c32ea into master 1 year ago

Reviewers

ggerganov

Assignees

No one assigned

Labels

None yet

Milestone

No milestone