onnxruntime
[js/webgpu] Optimize matmulnbits
#22360
Merged

[js/webgpu] Optimize matmulnbits #22360

fs-eire merged 6 commits into microsoft:main from qjia7:opt-matmulnbits
qjia7
qjia7 [js/webgpu] Optimize matmulnbits
93bd8391
qjia7 rename
429961ee
qjia7 Don't use workgroup memory for B
6f9845db
qjia7 tune workgroup size
ed571b63
qjia7 clean code
89201c35
qjia7 qjia7 changed the title [WIP][js/webgpu] Optimize matmulnbits [js/webgpu] Optimize matmulnbits 1 year ago
qjia7 qjia7 marked this pull request as ready for review 1 year ago
qjia7
qjia7 Limit to gen-12lp
033a72fd
guschmue guschmue added ep:WebGPU
fs-eire
fs-eire
fs-eire
azure-pipelines
azure-pipelines
azure-pipelines
guschmue
guschmue approved these changes on 2024-10-14
fs-eire fs-eire merged 8159723b into main 1 year ago
qjia7 qjia7 deleted the opt-matmulnbits branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone