DeepSpeed
a3926bbb
- infV2 fix for OPT size variants (#4694)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Hide Minimap (CTRL+M)
Commit
1 year ago
infV2 fix for OPT size variants (#4694) Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
References
#4694 - infV2 fix for OPT size variants
Author
mrwyattii
Parents
ce0ebdad
Files
5
.github/workflows
nv-a6000.yml
deepspeed/inference/v2
engine_factory.py
model_implementations
layer_container_base.py
opt
container.py
policy.py
Loading