vllm
[Performance] Cublas Bf16 Gate with Fp32 Output
#35121
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
5
Changes
View On
GitHub
[Performance] Cublas Bf16 Gate with Fp32 Output
#35121
vllm-bot
merged 5 commits into
vllm-project:main
from
roikoren755:feat/gate-linear-with-fallback
roikoren755
requested a review
from
tlrmchlsmth
9 days ago
roikoren755
requested a review
from
LucasWilkinson
9 days ago
roikoren755
requested a review
from
mgoin
9 days ago
roikoren755
requested a review
from
pavanimajety
9 days ago
mergify
added
ci/build
mergify
added
deepseek
gemini-code-assist
commented on 2026-02-23
robertgshaw2-redhat
changed the title
Gate linear with fallback
[Performance] Cublas Bf16 Gate with Fp32 Output
9 days ago
roikoren755
force pushed
from
8bf87bf5
to
9e4bcb28
8 days ago
roikoren755
force pushed
from
3b70bee2
to
50ecd354
8 days ago
roikoren755
force pushed
from
50ecd354
to
25e2bf52
7 days ago
mgoin
added
performance
mgoin
added
nvidia
mgoin
commented on 2026-02-25
Initial router custom op commit
9de54f41
Fix missing import
eab812dd
CR
f70d1619
Use not-deprecated GEMM algo
0e62a05c
CR and fixing fallback
5e18e1be
roikoren755
force pushed
from
25e2bf52
to
5e18e1be
6 days ago
mgoin
commented on 2026-02-26
mgoin
added
ready
vllm-bot
merged
38c498b8
into main
5 days ago
roikoren755
deleted the feat/gate-linear-with-fallback branch
2 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
mgoin
gemini-code-assist
tlrmchlsmth
LucasWilkinson
pavanimajety
Assignees
No one assigned
Labels
performance
ready
ci/build
deepseek
nvidia
Milestone
No milestone
Login to write a write a comment.
Login via GitHub