onnxruntime
2cf0ae7d
- [ROCm] Add AttentionMode to make attention logic streamline (#15978)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
[ROCm] Add AttentionMode to make attention logic streamline (#15978) Refactor for future kv cache change.
References
#15978 - [ROCm] Add AttentionMode to make attention logic streamline
Author
cloudhan
Parents
b28e927c
Loading