vllm
a532c838
- use 'max_active_experts' for moe lora input size (#33197)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
57 days ago
use 'max_active_experts' for moe lora input size (#33197) Signed-off-by: gnovack <gnovack@amazon.com>
References
#33197 - use 'max_active_experts' for moe lora input size
Author
gnovack
Parents
1e5ad9b7
Loading