llama.cpp
a4837577 - vulkan: use aligned loads for flash attention mask (#12853)

Commit
249 days ago
vulkan: use aligned loads for flash attention mask (#12853) Rewrite the stride logic for the mask tensor in the FA shader to force the stride to be aligned, to allow using more efficient loads.
Author
Parents
Loading