vllm
cede942b
- [Benchmark] Add support for multiple batch size benchmark through CLI in `benchmark_moe.py` (#20516)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
210 days ago
[Benchmark] Add support for multiple batch size benchmark through CLI in `benchmark_moe.py` (#20516) Signed-off-by: Brayden Zhong <b8zhong@uwaterloo.ca>
References
#20516 - [Benchmark] Add support for multiple batch size benchmark through CLI in `benchmark_moe.py` + Add Triton Fused MoE kernel config for FP8 E=16 on B200
Author
b8zhong
Parents
fe1e9248
Loading