vllm
Benchmark script for fp8 vs bf16 gemm
#17126
Merged

Benchmark script for fp8 vs bf16 gemm #17126

mgoin
mgoin Add benchmark for fp8 vs bf16 gemm
2ec2c26f
github-actions
mgoin mgoin changed the title Add benchmark for fp8 vs bf16 gemm Benchmark script for fp8 vs bf16 gemm 233 days ago
mgoin Merge branch 'main' into bench_fp8_gemm
45bc7fbd
mgoin Merge branch 'main' into bench_fp8_gemm
ed482251
mgoin Update with cudagraphs and remove uncommon schemes
80074717
mgoin Correct gbps
2fe731c3
mgoin Add more M
5ef160d1
alexm-redhat
alexm-redhat approved these changes on 2025-05-29
mgoin Merge branch 'main' into bench_fp8_gemm
58b8fa0d
mgoin Cleanup and correct metric
4deda826
mgoin mgoin added performance
mgoin mgoin added ready
mgoin mgoin merged f49239cb into main 197 days ago
mgoin mgoin deleted the bench_fp8_gemm branch 197 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone