DeepSpeed
8d9c9e74
- Scale query and key before attn_score gemm for more accurate attention
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
Scale query and key before attn_score gemm for more accurate attention
Author
Reza Yazdani
Parents
d1c3c0df
Loading