DeepSpeed
a5adb90d - Enabling CUDA-graph for the bert-type models (#1952)

Comment changes are shownComment changes are hidden
Commit
3 years ago
Enabling CUDA-graph for the bert-type models (#1952) Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
Parents
  • deepspeed
    • File
      __init__.py
    • inference
      • File
        engine.py