vllm
71ce4404
- Support S3 Sharded loading with RunAI Model Streamer (#16317)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
234 days ago
Support S3 Sharded loading with RunAI Model Streamer (#16317) Signed-off-by: Omer Dayan (SW-GPU) <omer@run.ai> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
References
#16317 - Support S3 Sharded loading with RunAI Model Streamer
Author
omer-dayan
Parents
188b7f9b
Loading