vllm
[V1][Quantization] Add CUDA graph compatible v1 GGUF support
#18646
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
6
Changes
View On
GitHub
[V1][Quantization] Add CUDA graph compatible v1 GGUF support
#18646
Isotr0py
merged 6 commits into
vllm-project:main
from
Isotr0py:v1-gguf
init v1 GGUF support
5ec7e1fc
clean up
92e61fe3
fix gguf moe
4e31dd23
fix kernel test
7419b109
Isotr0py
marked this pull request as ready for review
201 days ago
Isotr0py
requested a review
from
tlrmchlsmth
201 days ago
Isotr0py
requested a review
from
WoosukKwon
201 days ago
Isotr0py
requested a review
from
mgoin
201 days ago
Isotr0py
requested a review
from
robertgshaw2-redhat
201 days ago
mgoin
approved these changes on 2025-05-25
Isotr0py
enabled auto-merge (squash)
200 days ago
github-actions
added
ready
disabled auto-merge
200 days ago
Manually disabled by user
Merge branch 'vllm-project:main' into v1-gguf
35ef6d96
Isotr0py
enabled auto-merge (squash)
200 days ago
disable stablelm gguf test
4e26c5dc
Isotr0py
requested a review
from
DarkLight1337
199 days ago
Isotr0py
requested a review
from
ywang96
199 days ago
Isotr0py
merged
1f1b1bc0
into main
199 days ago
Isotr0py
deleted the v1-gguf branch
199 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
mgoin
tlrmchlsmth
WoosukKwon
robertgshaw2-redhat
DarkLight1337
ywang96
Assignees
No one assigned
Labels
ready
Milestone
No milestone
Login to write a write a comment.
Login via GitHub