vllm
0340f455 - Support expert parallel load balancing in Transformers backend (#26287)

Commit
107 days ago
Support expert parallel load balancing in Transformers backend (#26287) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Author
Parents
Loading