DeepSpeed
f8a65cb5 - Fix inference Api (#1095)

Commit
4 years ago
Fix inference Api (#1095) * Fix Inference and Quantization tutorial links * fix inference api * use correct attention scaling Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
Parents
Loading