vllm
68c4421b - [AMD][Quantization] Add TritonScaledMMLinearKernel since int8 is broken for AMD (#12282)

Commit
1 year ago
[AMD][Quantization] Add TritonScaledMMLinearKernel since int8 is broken for AMD (#12282) Signed-off-by: Randall Smith <Randall.Smith@amd.com>
Author
Parents
Loading