DeepSpeed
f8a65cb5
- Fix inference Api (#1095)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
4 years ago
Fix inference Api (#1095) * Fix Inference and Quantization tutorial links * fix inference api * use correct attention scaling Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
References
#1095 - Fix inference Api
Author
RezaYazdaniAminabadi
Parents
b49c99b0
Loading