onnxruntime
[js/webgpu] optimize MatmulNBits
#21747
Merged

[js/webgpu] optimize MatmulNBits #21747

guschmue merged 11 commits into microsoft:main from qjia7:opt_matmulnbits
qjia7
qjia7
qjia7 opt matmulnbits
79a4ac15
qjia7 add outputNumber > 1
6bd5417f
qjia7 qjia7 force pushed from 8d5c18ea to bba62597 1 year ago
qjia7 clean code
d28ac0a2
qjia7 qjia7 force pushed from bba62597 to d28ac0a2 1 year ago
qjia7 qjia7 changed the title [WIP] Opt matmulnbits Opt matmulnbits 1 year ago
qjia7 qjia7 marked this pull request as ready for review 1 year ago
qjia7
fs-eire
fs-eire
fs-eire
azure-pipelines
azure-pipelines
azure-pipelines
fs-eire fs-eire changed the title Opt matmulnbits [js/webgpu] optimize MatmulNBits 1 year ago
qjia7 formant and add missing shaderCache hints
dab4542d
fs-eire
fs-eire
fs-eire
azure-pipelines
azure-pipelines
azure-pipelines
qjia7 use global_idx
e86150a2
qjia7 tune outputNumber
e0072102
qjia7 add limitations
2bf70efe
qjia7
guschmue
guschmue
azure-pipelines
guschmue
azure-pipelines
azure-pipelines
guschmue
guschmue
azure-pipelines
satyajandhyala satyajandhyala added ep:WebGPU
satyajandhyala
qjia7
qjia7 Merge branch 'main' into opt_matmulnbits
f68a2da9
qjia7 fix workgroupSize to reduce shader recompilation
cbacc4a4
qjia7 support zeroPoints as input
cb775ed0
qjia7 replace the old algorithm
2ab2c4ed
qjia7
satyajandhyala
satyajandhyala commented on 2024-08-22
satyajandhyala
satyajandhyala
satyajandhyala commented on 2024-08-22
qjia7
qjia7 commented on 2024-08-23
qjia7 qjia7 requested a review from satyajandhyala satyajandhyala 1 year ago
satyajandhyala
satyajandhyala approved these changes on 2024-08-23
guschmue
guschmue approved these changes on 2024-08-23
guschmue
guschmue
azure-pipelines
azure-pipelines
guschmue
azure-pipelines
guschmue
azure-pipelines
guschmue guschmue merged 87165b92 into main 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone