vllm
75d29cf4
- [Perf] Cuda Kernel for Int8 Per Token Group Quant (#21476)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
212 days ago
[Perf] Cuda Kernel for Int8 Per Token Group Quant (#21476) Signed-off-by: yewentao256 <zhyanwentao@126.com>
References
#21476 - [Perf] Cuda Kernel for Int8 Per Token Group Quant
Author
yewentao256
Parents
41d3082c
Loading