onnxruntime
fd8ee489
- [JS/WebGPU] GroupQueryAttention rewrite (#20946)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
[JS/WebGPU] GroupQueryAttention rewrite (#20946) ### Description Implement JSEP GroupQueryAttention ### Motivation and Context Required to enable certain LLM models to run using WebGPU.
References
#20946 - [JS/WebGPU] GroupQueryAttention rewrite
Author
satyajandhyala
Parents
33e2f6ad
Loading