vllm
75d29cf4 - [Perf] Cuda Kernel for Int8 Per Token Group Quant (#21476)

Commit
212 days ago
[Perf] Cuda Kernel for Int8 Per Token Group Quant (#21476) Signed-off-by: yewentao256 <zhyanwentao@126.com>
Author
Parents
Loading