vllm
[CLI env var] Add VLLM_FLASH_ATTN_MAX_NUM_SPLITS_FOR_CUDA_GRAPH in env variables
#25274
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
11
Changes
View On
GitHub
[CLI env var] Add VLLM_FLASH_ATTN_MAX_NUM_SPLITS_FOR_CUDA_GRAPH in env variables
#25274
simon-mo
merged 11 commits into
vllm-project:main
from
Daisy-Ma-coder:main
Daisy-Ma-coder
requested a review
from
WoosukKwon
133 days ago
Daisy-Ma-coder
requested a review
from
robertgshaw2-redhat
133 days ago
Daisy-Ma-coder
requested a review
from
njhill
133 days ago
Daisy-Ma-coder
requested a review
from
ywang96
133 days ago
Daisy-Ma-coder
requested a review
from
comaniac
133 days ago
Daisy-Ma-coder
requested a review
from
alexm-redhat
133 days ago
mergify
added
v1
gemini-code-assist
commented on 2025-09-19
LucasWilkinson
approved these changes on 2025-09-20
add VLLM_FLASH_ATTN_MAX_NUM_SPLITS_FOR_CUDA_GRAPH in env variables so…
08fbd5d0
update tests
afb62f4b
resolve pre-commit test failure due to E501: line too long
bca6d5df
[Frontend] Pass API server count to each process (#23717)
6e64b128
resolve pre-commit test failure due to E501: line too long
982937a4
get rid of _DEFAULT_MAX_NUM_SPLITS_FOR_CUDA_GRAPH and extend to non-M…
9c6c81db
fix test failure on imports
92709f78
update test converage to FA2 and FA3
7a418c72
Daisy-Ma-coder
force pushed
132 days ago
Daisy-Ma-coder
requested a review
from
NickLucche
132 days ago
Daisy-Ma-coder
requested a review
from
tdoublep
132 days ago
Daisy-Ma-coder
requested a review
from
sighingnow
132 days ago
Daisy-Ma-coder
requested a review
from
bigPYJ1151
132 days ago
Daisy-Ma-coder
requested a review
from
hmellor
132 days ago
Daisy-Ma-coder
requested a review
from
ApostaC
132 days ago
Daisy-Ma-coder
requested a review
from
jeejeelee
132 days ago
Daisy-Ma-coder
requested a review
from
heheda12345
132 days ago
Daisy-Ma-coder
requested a review
from
mgoin
132 days ago
Daisy-Ma-coder
requested a review
from
tlrmchlsmth
132 days ago
Daisy-Ma-coder
requested a review
from
yewentao256
132 days ago
Daisy-Ma-coder
requested a review
from
DarkLight1337
132 days ago
Daisy-Ma-coder
requested a review
from
simon-mo
132 days ago
Daisy-Ma-coder
requested a review
from
aarnphm
132 days ago
Daisy-Ma-coder
requested a review
from
russellb
132 days ago
Daisy-Ma-coder
requested a review
from
benchislett
132 days ago
Daisy-Ma-coder
requested a review
from
youkaichao
132 days ago
Daisy-Ma-coder
requested a review
from
houseroad
132 days ago
Daisy-Ma-coder
requested a review
from
ProExpertProg
132 days ago
Daisy-Ma-coder
requested a review
from
chaunceyjiang
132 days ago
Daisy-Ma-coder
requested a review
from
zhuohan123
132 days ago
mergify
added
documentation
mergify
added
ci/build
mergify
added
frontend
mergify
added
multi-modality
mergify
added
performance
mergify
added
qwen
mergify
added
gpt-oss
mergify
added
structured-output
mergify
added
tpu
mergify
added
needs-rebase
mergify
added
kv-connector
Daisy-Ma-coder
force pushed
to
7a418c72
132 days ago
mergify
removed
tpu
mergify
removed
needs-rebase
simon-mo
added
ready
simon-mo
requested changes on 2025-09-20
Revert "[Frontend] Pass API server count to each process (#23717)"
7a4b5282
Daisy-Ma-coder
force pushed
to
7a4b5282
132 days ago
fix full cuda graph smoke test failure, int to str
668067be
fix full cuda graph smoke test failure, int to str
94410488
Daisy-Ma-coder
requested a review
from
simon-mo
132 days ago
simon-mo
approved these changes on 2025-09-22
simon-mo
merged
cfbee3d0
into main
130 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
simon-mo
LucasWilkinson
gemini-code-assist
WoosukKwon
robertgshaw2-redhat
njhill
ywang96
comaniac
alexm-redhat
NickLucche
tdoublep
sighingnow
bigPYJ1151
hmellor
ApostaC
jeejeelee
heheda12345
mgoin
tlrmchlsmth
yewentao256
DarkLight1337
aarnphm
russellb
benchislett
youkaichao
houseroad
ProExpertProg
chaunceyjiang
zhuohan123
Assignees
No one assigned
Labels
documentation
performance
structured-output
frontend
ready
ci/build
v1
multi-modality
qwen
gpt-oss
kv-connector
Milestone
No milestone
Login to write a write a comment.
Login via GitHub