llama.cpp
ggml-webgpu: Fix bug in FlashAttention support check
#22492
Merged

ggml-webgpu: Fix bug in FlashAttention support check #22492

reeselevine
reeselevine Fix flashattention support check for devices that don't support subgr…
70a4f4f6
reeselevine reeselevine requested a review 58 days ago
github-actions github-actions added ggml
github-actions github-actions added WebGPU
ArberSephirotheca
ArberSephirotheca commented on 2026-04-29
reeselevine set path to none if kv_tile doesn't fit
49db7ee5
ArberSephirotheca
ArberSephirotheca approved these changes on 2026-04-29
reeselevine reeselevine added merge ready
CISC
CISC approved these changes on 2026-04-29
ggerganov ggerganov merged d6a50940 into master 58 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone