DeepSpeed
74b6f763
- Merge branch 'master' into staging-inference-v2-5
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Hide Minimap (CTRL+M)
Commit
1 year ago
Merge branch 'master' into staging-inference-v2-5
References
#4604 - DeepSpeed-FastGen
Author
cmikeh2
Parents
49cea1ab
f0604078
Files
24
.github/workflows
cpu-inference.yml
README.md
csrc/deepspeed4science/evoformer_attn
attention_cu.cu
deepspeed
checkpoint
constants.py
ds_to_universal.py
module_inject
auto_tp.py
auto_tp_model_utils.py
fusedqkv_utils.py
layers.py
replace_module.py
tp_shard.py
runtime
config.py
constants.py
engine.py
pipe
engine.py
module.py
zero
stage3.py
stage_1_and_2.py
docs
assets/files
zeroquant_series.pdf
code-docs/source
model-checkpointing.rst
index.md
op_builder
builder.py
evoformer_attn.py
tests/unit/inference
test_inference.py
Loading