DeepSpeed
f5d987d1
- add some changes to inject/run the 70b llama model
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
add some changes to inject/run the 70b llama model
References
#4351 - DS-Inference Quantization refresh: Fix several issues and add more features
#4313 - Add the policy to run llama model from the official repo
Author
Reza Yazdani
Parents
7b2142c7
Loading