DeepSpeed
367d6f9c - Support InternLM (#4137)

Commit
2 years ago
Support InternLM (#4137) * correct inference with some debug codes. * remove prints * update transformer import set_qkv and format * support some lora abstract method * fix attn_ob * some debug * leave orig layer set by user * remove debugs * move attn ob to mlp module * move import transformer * init orig class only once * remove copyright --------- Co-authored-by: Lev Kurilenko <113481193+lekurile@users.noreply.github.com>
Author
Parents
Loading