onnxruntime
c19e4c02
- Implement QAttention And Enable tests (#16837)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
Implement QAttention And Enable tests (#16837)
References
#18530 - Add TryConvertTensorToBroadcastScalarfor QAttention and MatMulIntToFloat
Author
zhangxiang1993
Committer
jeffbloo
Parents
e9d330e4
Loading