vllm
260d119e
- [Kernel] Refactor CUTLASS kernels to always take scales that reside on the GPU (#5137)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
[Kernel] Refactor CUTLASS kernels to always take scales that reside on the GPU (#5137)
References
#5137 - [Kernel] Refactor CUTLASS kernels to always take scales that reside on the GPU
Author
tlrmchlsmth
Parents
a360ff80
Loading