vllm
ee52d990 - [Quantization] support logical_widths for fp8 marlin (#30962)

Commit
1 day ago
[Quantization] support logical_widths for fp8 marlin (#30962) Signed-off-by: Jinzhen Lin <jinzhen.ljz@antgroup.com> Signed-off-by: Jinzhen Lin <linjinzhen@hotmail.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Author
Parents
Loading