onnxruntime
[webgpu] Optimize flash decoding by merging QKT and SplitVx shader
#25929
Open
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
1
Changes
View On
GitHub
[webgpu] Optimize flash decoding by merging QKT and SplitVx shader
#25929
xiaofeihan1
wants to merge 1 commit into
microsoft:main
from
xiaofeihan1:xiaofeihan/optimize_flash_decoding
implement
0a3da97d
Login to write a write a comment.
Login via GitHub
Reviewers
No reviews
Assignees
No one assigned
Labels
None yet
Milestone
No milestone
Login to write a write a comment.
Login via GitHub