vllm
38c498b8
- [Performance] Cublas Bf16 Gate with Fp32 Output (#35121)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 days ago
[Performance] Cublas Bf16 Gate with Fp32 Output (#35121) Signed-off-by: Roi Koren <roik@nvidia.com>
References
#35121 - [Performance] Cublas Bf16 Gate with Fp32 Output
Author
roikoren755
Parents
56a63717
Loading