Vulkan: Support fp32 accumulator in quantized matmul to fix GLM4-32B incoherence #13607
Vulkan: Add f32 accumulator support to quantized mul mat to fix GLM4 …
35d675d8
0cc4m
marked this pull request as ready for review 212 days ago
0cc4m
merged
8960efd0
into master 212 days ago
0cc4m
deleted the 0cc4m/vulkan-mmq-fp32-acc-glm4 branch 212 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub