vllm
1f1b1bc0 - [V1][Quantization] Add CUDA graph compatible v1 GGUF support (#18646)

Commit
198 days ago
[V1][Quantization] Add CUDA graph compatible v1 GGUF support (#18646) Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by: Isotr0py <2037008807@qq.com>
Author
Parents
Loading