metal : use F32 attention accumulators in FA kernels #13975
metal : use F32 accumulators in FA kernels
21be70ec
ggerganov
merged
ea394d7a
into master 107 days ago
ggerganov
deleted the gg/metal-fa-acc-f32 branch 107 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub