eicherseiji
changed the title Fix FlashMLA detection in ray environment Avoid a crash in is_flashmla_supported() by handling Platform.get_device_capability()'s optional return value266 days ago
eicherseiji
changed the title Avoid a crash in is_flashmla_supported() by handling Platform.get_device_capability()'s optional return value Avoid a crash in is_flashmla_supported() by moving block_size fixup to GPU worker265 days ago
eicherseiji
changed the title Avoid a crash in is_flashmla_supported() by moving block_size fixup to GPU worker [Regression][Bugfix] Avoid a crash in is_flashmla_supported() by moving block_size fixup to GPU worker265 days ago
eicherseiji
changed the title [Regression][Bugfix] Avoid a crash in is_flashmla_supported() by moving block_size fixup to GPU worker [Regression][Bugfix] Avoid a crash in is_flashmla_supported() by moving FlashMLA block_size fixup to GPU worker265 days ago
eicherseiji
changed the title [Regression][Bugfix] Avoid a crash in is_flashmla_supported() by moving FlashMLA block_size fixup to GPU worker [Bugfix] Move hardware-dependent configuration resolution (FlashMLA, `dtype: auto`) to worker258 days ago
eicherseiji
changed the title [Bugfix] Move hardware-dependent configuration resolution (FlashMLA, `dtype: auto`) to worker [Bugfix] Move hardware-dependent configuration resolution (FlashMLA capability, `dtype: 'auto'`) to worker258 days ago
eicherseiji
changed the title [Bugfix] Move hardware-dependent configuration resolution (FlashMLA capability, `dtype: 'auto'`) to worker [Bugfix] Allow `CUDA_VISIBLE_DEVICES=''` in `Platform.device_id_to_physical_device_id`246 days ago
Fix FlashMLA detection in ray environment
22fdba14
Move FlashMLA capability check to GPU worker
979129b7
Unit test
5055e5bd
reate helper function _resolve_hardware_dependent_config
9e943ea0
Add V0 support under env flag
b812cd17
Parameterize _get_and_verify_dtype by defer_to_worker
4e4e092b
Change parameter name to 'defer_auto_to_worker'
baadec88
Add _resolve_hardware_dependent_config for V0 Worker
8df72535
Resolve lora dtype
b2fea3c6
Move V1 supported type check into auto resolution's if block
6500644d
Move config resolution to WorkerBase
8c308410
Fix kwargs list
7ff5daea
Inline init_config with init_worker
ceccc0d1
Testing without V0-specific flag
c1912995
Remove whitespace changes
190f9f75
Avoid modifying _get_and_verify_dtype signature for the sake of testing
ab584319
Only check V1 supported dtypes in V1
5f87e531
Add V0 support under env flag
b812cd17
Change parameter name to 'defer_auto_to_worker'
baadec88
Add _resolve_hardware_dependent_config for V0 Worker
8df72535
Resolve lora dtype
b2fea3c6
Move V1 supported type check into auto resolution's if block
6500644d
Move config resolution to WorkerBase
8c308410
Fix kwargs list
7ff5daea
Inline init_config with init_worker
ceccc0d1
Testing without V0-specific flag
c1912995
Avoid modifying _get_and_verify_dtype signature for the sake of testing
ab584319
Move config fixup logic to VllmConfig.resolve_config_with_hardware, f…
Login to write a write a comment.
Login via GitHub