vllm
[V1][Quantization] Add CUDA graph compatible v1 GGUF support
#18646
Merged

[V1][Quantization] Add CUDA graph compatible v1 GGUF support #18646

Isotr0py merged 6 commits into vllm-project:main from Isotr0py:v1-gguf
Isotr0py
Isotr0py init v1 GGUF support
5ec7e1fc
github-actions
Isotr0py clean up
92e61fe3
Isotr0py fix gguf moe
4e31dd23
Isotr0py fix kernel test
7419b109
Isotr0py Isotr0py marked this pull request as ready for review 201 days ago
Isotr0py Isotr0py requested a review from tlrmchlsmth tlrmchlsmth 201 days ago
Isotr0py Isotr0py requested a review from WoosukKwon WoosukKwon 201 days ago
Isotr0py Isotr0py requested a review from mgoin mgoin 201 days ago
Isotr0py Isotr0py requested a review from robertgshaw2-redhat robertgshaw2-redhat 201 days ago
mgoin
mgoin approved these changes on 2025-05-25
Isotr0py Isotr0py enabled auto-merge (squash) 200 days ago
github-actions github-actions added ready
disabled auto-merge 200 days ago
Manually disabled by user
Isotr0py Merge branch 'vllm-project:main' into v1-gguf
35ef6d96
Isotr0py Isotr0py enabled auto-merge (squash) 200 days ago
mgoin
Isotr0py
Isotr0py disable stablelm gguf test
4e26c5dc
Isotr0py Isotr0py requested a review from DarkLight1337 DarkLight1337 199 days ago
Isotr0py Isotr0py requested a review from ywang96 ywang96 199 days ago
Isotr0py Isotr0py merged 1f1b1bc0 into main 199 days ago
Isotr0py Isotr0py deleted the v1-gguf branch 199 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone