DeepSpeed
f5d987d1 - add some changes to inject/run the 70b llama model

Commit

2 years ago

add some changes to inject/run the 70b llama model

References

#4313 - Add the policy to run llama model from the official repo

#4351 - DS-Inference Quantization refresh: Fix several issues and add more features

Author

Reza Yazdani

Parents

Loading