vulkan: Use fp16 for the flash attention P*V multiplication #12783
vulkan: Use fp16 for the flash attention P*V multiplication
dab1f028
0cc4m
approved these changes
on 2025-04-09
0cc4m
merged
7ecd780b
into master 151 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub