vllm
[Performance] Cublas Bf16 Gate with Fp32 Output
#35121
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
5
Changes
View On
GitHub
Commits
Initial router custom op commit
roikoren755
committed
6 days ago
Fix missing import
roikoren755
committed
6 days ago
CR
roikoren755
committed
6 days ago
Use not-deprecated GEMM algo
roikoren755
committed
6 days ago
CR and fixing fallback
roikoren755
committed
6 days ago
Loading