vllm
1f1b1bc0
- [V1][Quantization] Add CUDA graph compatible v1 GGUF support (#18646)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
198 days ago
[V1][Quantization] Add CUDA graph compatible v1 GGUF support (#18646) Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by: Isotr0py <2037008807@qq.com>
References
#18646 - [V1][Quantization] Add CUDA graph compatible v1 GGUF support
Author
Isotr0py
Parents
1f88dbd2
Loading