onnxruntime
fd8ee489 - [JS/WebGPU] GroupQueryAttention rewrite (#20946)

Commit
1 year ago
[JS/WebGPU] GroupQueryAttention rewrite (#20946) ### Description Implement JSEP GroupQueryAttention ### Motivation and Context Required to enable certain LLM models to run using WebGPU.
Parents
Loading