vllm
[CLI env var] Add VLLM_FLASH_ATTN_MAX_NUM_SPLITS_FOR_CUDA_GRAPH in env variables
#25274
Merged

[CLI env var] Add VLLM_FLASH_ATTN_MAX_NUM_SPLITS_FOR_CUDA_GRAPH in env variables #25274

simon-mo merged 11 commits into vllm-project:main from Daisy-Ma-coder:main
Daisy-Ma-coder
Daisy-Ma-coder Daisy-Ma-coder requested a review from WoosukKwon WoosukKwon 133 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from robertgshaw2-redhat robertgshaw2-redhat 133 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from njhill njhill 133 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from ywang96 ywang96 133 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from comaniac comaniac 133 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from alexm-redhat alexm-redhat 133 days ago
github-actions
mergify mergify added v1
gemini-code-assist
gemini-code-assist commented on 2025-09-19
MatthewBonanni
Daisy-Ma-coder
LucasWilkinson
LucasWilkinson approved these changes on 2025-09-20
add VLLM_FLASH_ATTN_MAX_NUM_SPLITS_FOR_CUDA_GRAPH in env variables so…
08fbd5d0
update tests
afb62f4b
resolve pre-commit test failure due to E501: line too long
bca6d5df
DarkLight1337 [Frontend] Pass API server count to each process (#23717)
6e64b128
resolve pre-commit test failure due to E501: line too long
982937a4
get rid of _DEFAULT_MAX_NUM_SPLITS_FOR_CUDA_GRAPH and extend to non-M…
9c6c81db
fix test failure on imports
92709f78
update test converage to FA2 and FA3
7a418c72
Daisy-Ma-coder Daisy-Ma-coder force pushed 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from NickLucche NickLucche 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from tdoublep tdoublep 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from sighingnow sighingnow 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from bigPYJ1151 bigPYJ1151 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from hmellor hmellor 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from ApostaC ApostaC 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from jeejeelee jeejeelee 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from heheda12345 heheda12345 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from mgoin mgoin 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from tlrmchlsmth tlrmchlsmth 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from yewentao256 yewentao256 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from DarkLight1337 DarkLight1337 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from simon-mo simon-mo 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from aarnphm aarnphm 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from russellb russellb 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from benchislett benchislett 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from youkaichao youkaichao 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from houseroad houseroad 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from ProExpertProg ProExpertProg 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from chaunceyjiang chaunceyjiang 132 days ago
Daisy-Ma-coder Daisy-Ma-coder requested a review from zhuohan123 zhuohan123 132 days ago
mergify mergify added documentation
mergify mergify added ci/build
mergify mergify added frontend
mergify mergify added multi-modality
mergify mergify added performance
mergify mergify added qwen
mergify mergify added gpt-oss
mergify mergify added structured-output
mergify mergify added tpu
mergify
mergify mergify added needs-rebase
mergify mergify added kv-connector
Daisy-Ma-coder Daisy-Ma-coder force pushed to 7a418c72 132 days ago
mergify mergify removed tpu
mergify mergify removed needs-rebase
simon-mo simon-mo added ready
simon-mo
simon-mo requested changes on 2025-09-20
Revert "[Frontend] Pass API server count to each process (#23717)"
7a4b5282
Daisy-Ma-coder Daisy-Ma-coder force pushed to 7a4b5282 132 days ago
fix full cuda graph smoke test failure, int to str
668067be
fix full cuda graph smoke test failure, int to str
94410488
Daisy-Ma-coder Daisy-Ma-coder requested a review from simon-mo simon-mo 132 days ago
simon-mo
simon-mo approved these changes on 2025-09-22
simon-mo simon-mo merged cfbee3d0 into main 130 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone