vllm
cfa134d2 - [Bugfix/CI] Fixup benchmark_moe.py (#12562)

Commit
1 year ago
[Bugfix/CI] Fixup benchmark_moe.py (#12562) Fixes `is_marlin` not being passed into `get_default_config` Also allow `--tensor-parallel-size` in addition to `-tp` and `--tp-size` Signed-off-by: Tyler Michael Smith <tyler@neuralmagic.com>
Author
Parents
Loading