onnxruntime
[webgpu] Optimize MatMulNBits for f16 Block32 prefill performance
#23908
Merged

[webgpu] Optimize MatMulNBits for f16 Block32 prefill performance #23908

daijh
daijh
daijh
daijh
guschmue guschmue added ep:WebGPU
guschmue
azure-pipelines
guschmue
guschmue
guschmue
azure-pipelines
azure-pipelines
azure-pipelines
sushraja-msft
sushraja-msft commented on 2025-03-06
sushraja-msft
sushraja-msft commented on 2025-03-06
sushraja-msft
sushraja-msft commented on 2025-03-06
daijh
daijh
guschmue
sushraja-msft
daijh
daijh
daijh daijh force pushed from 8a250dbf to 74da2901 1 year ago
guschmue
guschmue
guschmue
azure-pipelines
guschmue
azure-pipelines
azure-pipelines
azure-pipelines
daijh
sushanthr
sushanthr commented on 2025-03-25
sushanthr
sushanthr commented on 2025-03-25
sushanthr
sushanthr commented on 2025-03-25
sushanthr
sushanthr commented on 2025-03-25
sushanthr
sushanthr approved these changes on 2025-03-25
daijh
sushanthr
sushanthr commented on 2025-03-26
sushanthr
sushanthr approved these changes on 2025-03-26
daijh
guschmue
daijh [webgpu] Optimize MatMulNBits for f16 Block32 prefill performance
1be49b21
daijh Resolve comments
4ead0043
daijh Fix variable naming
14bbe9df
daijh Add comment on f32 accumulator
0f55827f
daijh Improve comment
695d9d05
daijh More comment and avoid magic number
fd751cbd
daijh Improve variable naming
58d76f62
daijh Add tile_m and tile_n into constructor
ae482a2a
daijh Rename to MatMulNBitsWideTileProgram
287be7e3
daijh Improve comment to reflect new naming
ca1710a9
daijh daijh force pushed from 4d3801f3 to ca1710a9 346 days ago
daijh
guschmue
daijh Fix lint
17c0b1f3
daijh
daijh
guschmue
guschmue dismissed these changes on 2025-04-02
daijh Prefer onnxruntime::narrow
e7f8bb43
guschmue
azure-pipelines
guschmue guschmue dismissed their stale review 343 days ago
done
guschmue
guschmue approved these changes on 2025-04-04
guschmue guschmue merged 3dfc2ae3 into main 343 days ago
daijh daijh deleted the matmul-f16-block32-prefill branch 343 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone