onnxruntime
[webgpu] Use subgroup for matmulnbits
#23224
Merged

[webgpu] Use subgroup for matmulnbits #23224

qjia7
qjia7 [webgpu] Add kernel type to profile info
3b32acc0
qjia7 relax some limitations
8e0b27f9
qjia7 [webgpu] Use override shape in shader key
c0dd1db2
qjia7 subgroup support only for subgroup size = 32
a66dd014
qjia7 support subgroup with size = 32
d3a196b6
qjia7 subgroup size = 32, workgroup size = 64
99fe6977
qjia7 put a/scale data into workgroup memory
0e850483
qjia7 support subgroup size = 8/16/32/64
6feb70f6
qjia7 workaround the issue that subgroupShuffle is very slow
55fc6bbe
qjia7 add sg_size check
128cd7de
qjia7 Revert "add sg_size check"
0775e4eb
qjia7 apply subgroup for tile_m > 1
454fd0dc
qjia7 check subgroup feature
16fc0757
qjia7 only apply subgroup for intel GPUs
622bdd08
qjia7 Revert "[webgpu] Use override shape in shader key"
729df58e
qjia7 Revert "[webgpu] Add kernel type to profile info"
dff1961e
qjia7 add more limitations to use subgroup
8328e149
qjia7 qjia7 changed the title [Not for Review] [webgpu] Test subgroup for matmulnbits [webgpu] Use subgroup for matmulnbits 363 days ago
qjia7 qjia7 marked this pull request as ready for review 363 days ago
qjia7
guschmue
guschmue
azure-pipelines
guschmue
guschmue
azure-pipelines
azure-pipelines
azure-pipelines
guschmue guschmue added ep:WebGPU
guschmue
guschmue approved these changes on 2025-01-12
guschmue
azure-pipelines
guschmue guschmue merged 80d8931f into main 352 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone