llama.cpp
ggml-webgpu: enable FLASH_ATTN_EXT on browser without subgroup matrix
#22199
Merged

Loading