Inference Refactor (replace_with_policy, model_implementations) #2554
Public inference v2 (#387)
102a1bdb
remove notes.
3f511079
Container based OPT is working
8ca77499
Megatron container working w/ moe=1
75003bed
Add ds_megatron_gpt model implementation + make explicit megatron is …
9a36258d
Clean up
09f37abb
Merge branch 'master' into inference-refactor-v1
74233437
TODO comments
9b24af91
Update megatron container to work with MoE==1 text-generation example
d190f68f
move _transpose() to base container
5651f9e3
Add megatron_v2 class variable to megatron policy
8186da32
Change _transpose to transpose_qkv_alignment
699336bb
Properly source megatron_v2 flag from policy in containers
eecae6fe
Add comprehension of megatron_v2 in gptneox, and rotation config
992ace22
remove TODO
28e6b617
Make containers inherit scale_attention from policy
105ba4b3
Merge branch 'master' into inference-refactor-v1
4479a07b
Update all policies for generic load checkpoint
4e668887
Add generic load checkpoint to replace_layer, still need to add to co…
79643351
Make megatron_v2 part of base container
52bba028
Update from model_config to config in replace_module.py
3feff1fd
Merge branch 'master' into inference-refactor-v1
55e929df
Get generic checkpoint loading working w/o containers
a8f440e3
Use self. for tensors in bloom container
7b65351a
Comprehend quantizer in containers + generic checkpoint works for GPT…
794092e3
Remove .scale comments from ds_mlp and ds_attention
fbd0b65a
Whitespace in replace_layer to match replace_module
bc9002ae
Apply weight quantization in BERT container
791935ca
Small clean-up
da475804
Move weight quantization into separate function in base container
381e61b3
First base container re-org
a0979fea
Fix a few bugs after the base re-org
5d602d62
Remove unnecessary code from BERT container
e71035d2
moved megatron and meta tensor support to separate containers, added …
1d5149d3
Add MegatronContainer to GPTNEOX container
22d7c8c1
Move feature containers into features folder
8136d71a
Update BLOOM container
7e03935c
Remove gpt2 model container transpose, since base comprehends guard f…
bb297022
Add __init__.py in features folder
ede2749c
Merge branch 'master' into inference-refactor-v1
2294cb8d
Merge branch 'master' into inference-refactor-v1
b8527530
Add MoE container (still needs testing) + various refactor
7884c9c6
Merge branch 'master' into inference-refactor-v1
b368af45
Merge branch 'master' into inference-refactor-v1
ae313969
Fix syntax/import for base moe container
6cc71070
Source config params instead of using magic numbers
a6a807be
Add DistilBERT to utils.py
aff6cc3a
Create MoE config function
fa56b96b
Use **kwargs for containers
5a2c192b
Remove init_inference_v2 function, use getattr in place of hasattr
32180da6
Merge branch 'master' into inference-refactor-v1
711231d5
Move replace_module components into replace_layer, part of consolidation
fc921397
Move policies + containers into same file
b1edf4fc
Update policy imports
3a6ff98c
Remove separate policy files since policies consolidated in container…
dac16023
Update replace_layer w/ code from replace_module
fa437de3
Make replace_layer -> replace_module, update to match replace_module …
a1122d61
Update minor policy changes from master
58b1e374
container fixes for SD models
c98c757d
Clean up and polish
6a814bd8
print_configuration to use logger
cedd3d59
Remove debug prints from policy_to_ds_container
b8844799
lekurile
changed the title Inference v2 Inference Refactor (replace_with_policy, model_implementations) 2 years ago
Merge branch 'master' into inference-refactor-v1
24984334
lekurile
marked this pull request as ready for review 2 years ago
Merge branch 'master' into inference-refactor-v1
bfeba271
Update CLIP policy
ed09e065
Merge branch 'master' into inference-refactor-v1
3ca419d6
refactor policy to container mapping function
3a2fa502
Use @abstractmethod on policy functions
c6a4c50b
Merge branch 'master' into inference-refactor-v1
cdca7be5
mrwyattii
approved these changes
on 2023-01-19
lekurile
approved these changes
on 2023-01-19
jeffra
approved these changes
on 2023-01-19
jeffra
merged
867da307
into master 2 years ago
jeffra
deleted the inference-refactor-v1 branch 2 years ago
Login to write a write a comment.
Login via GitHub