vllm
96ad65b7 - [Transform] [Quantization] Add QuTLASS support to vLLM (#24440)

Commit
114 days ago
[Transform] [Quantization] Add QuTLASS support to vLLM (#24440) Signed-off-by: LopezCastroRoberto <roberto.lopez.castro@udc.es> Signed-off-by: Roberto L. Castro <38211239+LopezCastroRoberto@users.noreply.github.com> Signed-off-by: Andrei Panferov <andrei@panferov.org> Co-authored-by: Andrei Panferov <andrei@panferov.org> Co-authored-by: Michael Goin <mgoin64@gmail.com>
Parents
Loading