llama.cpp
ggml-webgpu: parameterize submission size and add iOS specific limits
#21533
Merged

ggml-webgpu: parameterize submission size and add iOS specific limits #21533

reeselevine
reeselevine Work towards removing bitcast
f1eb80ef
reeselevine Move rest of existing types over
e9af481e
reeselevine Add timeout back to wait and remove synchronous set_tensor/memset_tensor
b3aa3be8
reeselevine move to unpackf16 for wider compatibility
67fe0897
reeselevine cleanup
e85e8bcc
reeselevine Remove deadlock condition in free_bufs
32ee70a2
reeselevine Merge remote-tracking branch 'upstream/master' into remove_bitcast
309ef1f7
reeselevine Start work on removing parameter buffer pools
1fc8b64a
reeselevine Simplify and optimize further
9592ed56
reeselevine Merge remote-tracking branch 'upstream/master' into one-buffer
82008f32
reeselevine simplify profile futures
d307d47c
reeselevine Fix stride
a2c1d910
reeselevine Try using a single command buffer per batch
e8ea0441
reeselevine Merge remote-tracking branch 'upstream/master' into one-buffer
e954542c
reeselevine formatting
7450dd01
reeselevine Add parameters for different browsers in-flight submissions
cc6f8089
reeselevine Update handling of batch size too
d63c7395
reeselevine Throttle ios as much as possible
f9d63dc5
reeselevine Merge remote-tracking branch 'upstream/master' into browser-params
29ba9c73
reeselevine reeselevine requested a review 5 days ago
github-actions github-actions added ggml
github-actions github-actions added WebGPU
reeselevine Increase timeout for llvm-pipe testing
5b0fd3d2
reeselevine reeselevine added merge ready
nikhilJain17
nikhilJain17 approved these changes on 2026-04-07
ggerganov ggerganov merged 957d717c into master 4 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone