vllm
[Bugfix] Allow `CUDA_VISIBLE_DEVICES=''` in `Platform.device_id_to_physical_device_id`
#18979
Merged

[Bugfix] Allow `CUDA_VISIBLE_DEVICES=''` in `Platform.device_id_to_physical_device_id` #18979

eicherseiji
github-actions
eicherseiji eicherseiji changed the title Fix FlashMLA detection in ray environment Avoid a crash in is_flashmla_supported() by handling Platform.get_device_capability()'s optional return value 266 days ago
eicherseiji eicherseiji requested a review from WoosukKwon WoosukKwon 265 days ago
eicherseiji eicherseiji requested a review from robertgshaw2-redhat robertgshaw2-redhat 265 days ago
eicherseiji eicherseiji requested a review from njhill njhill 265 days ago
eicherseiji eicherseiji requested a review from ywang96 ywang96 265 days ago
eicherseiji eicherseiji requested a review from comaniac comaniac 265 days ago
eicherseiji eicherseiji requested a review from alexm-redhat alexm-redhat 265 days ago
mergify mergify added v1
mergify
eicherseiji eicherseiji changed the title Avoid a crash in is_flashmla_supported() by handling Platform.get_device_capability()'s optional return value Avoid a crash in is_flashmla_supported() by moving block_size fixup to GPU worker 265 days ago
mergify mergify added needs-rebase
eicherseiji eicherseiji force pushed 265 days ago
mergify mergify removed needs-rebase
eicherseiji eicherseiji changed the title Avoid a crash in is_flashmla_supported() by moving block_size fixup to GPU worker [Regression][Bugfix] Avoid a crash in is_flashmla_supported() by moving block_size fixup to GPU worker 265 days ago
eicherseiji eicherseiji changed the title [Regression][Bugfix] Avoid a crash in is_flashmla_supported() by moving block_size fixup to GPU worker [Regression][Bugfix] Avoid a crash in is_flashmla_supported() by moving FlashMLA block_size fixup to GPU worker 265 days ago
eicherseiji
kouroshHakha
ProExpertProg
njhill njhill requested a review from LucasWilkinson LucasWilkinson 265 days ago
njhill
njhill commented on 2025-06-04
LucasWilkinson
eicherseiji eicherseiji force pushed 264 days ago
eicherseiji
eicherseiji eicherseiji force pushed 264 days ago
eicherseiji
eicherseiji eicherseiji requested a review from hmellor hmellor 263 days ago
eicherseiji eicherseiji requested a review from mgoin mgoin 263 days ago
eicherseiji eicherseiji requested a review from DarkLight1337 DarkLight1337 263 days ago
eicherseiji eicherseiji requested a review from tlrmchlsmth tlrmchlsmth 263 days ago
eicherseiji eicherseiji requested a review from simon-mo simon-mo 263 days ago
eicherseiji eicherseiji requested a review from aarnphm aarnphm 263 days ago
eicherseiji eicherseiji requested a review from zhuohan123 zhuohan123 263 days ago
eicherseiji eicherseiji requested a review from youkaichao youkaichao 263 days ago
mergify mergify added documentation
mergify mergify added ci/build
mergify mergify added frontend
mergify mergify added multi-modality
mergify mergify added tpu
mergify mergify added tool-calling
eicherseiji eicherseiji force pushed 263 days ago
mergify mergify removed tpu
eicherseiji
eicherseiji eicherseiji force pushed 263 days ago
eicherseiji eicherseiji force pushed 261 days ago
eicherseiji
eicherseiji eicherseiji force pushed 260 days ago
mergify
mergify mergify added needs-rebase
eicherseiji eicherseiji force pushed to eea7dcbe 260 days ago
mergify mergify removed needs-rebase
eicherseiji eicherseiji force pushed to 9f24ef5b 260 days ago
eicherseiji
ruisearch42
ruisearch42 commented on 2025-06-11
eicherseiji eicherseiji changed the title [Regression][Bugfix] Avoid a crash in is_flashmla_supported() by moving FlashMLA block_size fixup to GPU worker [Bugfix] Move hardware-dependent configuration resolution (FlashMLA, `dtype: auto`) to worker 258 days ago
eicherseiji eicherseiji changed the title [Bugfix] Move hardware-dependent configuration resolution (FlashMLA, `dtype: auto`) to worker [Bugfix] Move hardware-dependent configuration resolution (FlashMLA capability, `dtype: 'auto'`) to worker 258 days ago
kouroshHakha
kouroshHakha commented on 2025-06-12
gemini-code-assist
gemini-code-assist
eicherseiji
eicherseiji eicherseiji force pushed 257 days ago
kouroshHakha kouroshHakha added ready
kouroshHakha
kouroshHakha commented on 2025-06-16
kouroshHakha
eicherseiji
eicherseiji eicherseiji force pushed 253 days ago
eicherseiji eicherseiji force pushed 253 days ago
kouroshHakha
kouroshHakha approved these changes on 2025-06-16
eicherseiji eicherseiji force pushed 253 days ago
eicherseiji
eicherseiji eicherseiji force pushed 252 days ago
eicherseiji eicherseiji force pushed 252 days ago
kouroshHakha
kouroshHakha commented on 2025-06-18
ruisearch42
ruisearch42 commented on 2025-06-18
eicherseiji
eicherseiji eicherseiji force pushed 247 days ago
eicherseiji eicherseiji force pushed 247 days ago
eicherseiji eicherseiji force pushed 247 days ago
eicherseiji
kouroshHakha
kouroshHakha approved these changes on 2025-06-23
ruisearch42
ruisearch42 commented on 2025-06-24
eicherseiji eicherseiji changed the title [Bugfix] Move hardware-dependent configuration resolution (FlashMLA capability, `dtype: 'auto'`) to worker [Bugfix] Allow `CUDA_VISIBLE_DEVICES=''` in `Platform.device_id_to_physical_device_id` 246 days ago
eicherseiji
eicherseiji Fix FlashMLA detection in ray environment
22fdba14
eicherseiji Move FlashMLA capability check to GPU worker
979129b7
eicherseiji Unit test
5055e5bd
eicherseiji reate helper function _resolve_hardware_dependent_config
9e943ea0
eicherseiji Add V0 support under env flag
b812cd17
eicherseiji Parameterize _get_and_verify_dtype by defer_to_worker
4e4e092b
eicherseiji Change parameter name to 'defer_auto_to_worker'
baadec88
eicherseiji Add _resolve_hardware_dependent_config for V0 Worker
8df72535
eicherseiji Resolve lora dtype
b2fea3c6
eicherseiji Move V1 supported type check into auto resolution's if block
6500644d
eicherseiji Move config resolution to WorkerBase
8c308410
eicherseiji Fix kwargs list
7ff5daea
eicherseiji Inline init_config with init_worker
ceccc0d1
eicherseiji Testing without V0-specific flag
c1912995
eicherseiji Remove whitespace changes
190f9f75
eicherseiji Avoid modifying _get_and_verify_dtype signature for the sake of testing
ab584319
eicherseiji Only check V1 supported dtypes in V1
5f87e531
eicherseiji Add V0 support under env flag
b812cd17
eicherseiji Change parameter name to 'defer_auto_to_worker'
baadec88
eicherseiji Add _resolve_hardware_dependent_config for V0 Worker
8df72535
eicherseiji Resolve lora dtype
b2fea3c6
eicherseiji Move V1 supported type check into auto resolution's if block
6500644d
eicherseiji Move config resolution to WorkerBase
8c308410
eicherseiji Fix kwargs list
7ff5daea
eicherseiji Inline init_config with init_worker
ceccc0d1
eicherseiji Testing without V0-specific flag
c1912995
eicherseiji Avoid modifying _get_and_verify_dtype signature for the sake of testing
ab584319
eicherseiji Move config fixup logic to VllmConfig.resolve_config_with_hardware, f…
e77592cc
eicherseiji Support inplace model weights loading
8e0f77b6
eicherseiji Try resolving dtype on worker in ray
d5f88bb4
eicherseiji Treat empty device control env var as unset
afe0e010
eicherseiji Allow empty string CUDA_VISIBLE_DEVICES
66bd1fb2
eicherseiji eicherseiji force pushed to 66bd1fb2 245 days ago
eicherseiji
ruisearch42
ruisearch42 approved these changes on 2025-06-25
kouroshHakha
kouroshHakha approved these changes on 2025-06-25
aarnphm
aarnphm approved these changes on 2025-06-25
vllm-bot vllm-bot merged 65397e40 into main 244 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone