vllm
[Bench] Add NVFP4 GEMM benchmark script
#20578
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
4
Changes
View On
GitHub
[Bench] Add NVFP4 GEMM benchmark script
#20578
mgoin
merged 4 commits into
vllm-project:main
from
neuralmagic:nvfp4-gemm-bench
Add NVFP4 GEMM benchmark script
789b94e5
Fix
dbf87171
mgoin
added
quantization
mgoin
added
perf-benchmarks
gemini-code-assist
commented on 2025-07-07
mgoin
marked this pull request as ready for review
164 days ago
mergify
added
performance
gemini-code-assist
commented on 2025-07-07
Add CC guard
133ba5bc
Fix global scale
463a4447
mgoin
added
ready
mgoin
requested a review
from
tlrmchlsmth
163 days ago
mgoin
requested a review
from
robertgshaw2-redhat
163 days ago
mgoin
merged
0bbac1c1
into main
162 days ago
mgoin
deleted the nvfp4-gemm-bench branch
162 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
gemini-code-assist
tlrmchlsmth
robertgshaw2-redhat
Assignees
No one assigned
Labels
performance
quantization
perf-benchmarks
ready
Milestone
No milestone
Login to write a write a comment.
Login via GitHub