DeepSpeed
a3926bbb - infV2 fix for OPT size variants (#4694)

Comment changes are shownComment changes are hidden
Commit
1 year ago
infV2 fix for OPT size variants (#4694) Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
Author
Parents
  • .github/workflows
    • File
      nv-a6000.yml
  • deepspeed/inference/v2
    • File
      engine_factory.py
    • model_implementations
      • File
        layer_container_base.py
      • opt
        • File
          container.py
        • File
          policy.py