onnxruntime
b77dbd43
- Optimize layout for SubgroupMatrixLoad on Intel (#25384)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
154 days ago
Optimize layout for SubgroupMatrixLoad on Intel (#25384) This introduces a new LayoutProgram to pre-process the input matrix A, converting it to a layout that is more efficient for the SubgroupMatrixLoad operation on Intel GPUs.
References
#25384 - Optimize layout for SubgroupMatrixLoad on Intel
Author
jchen10
Parents
e57dc2a2
Loading