DeepSpeed
Fix llama meta tensor loading in AutoTP and kernel injected inference
#3608
Merged

Loading